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PATTERN ANALYSIS AND CONFIGURAL SCORING 


PAUL HORST 
University of Washington 


PROBLEM 


Recently Meehl®? has called attention to the emphasis which clinicians place 
on the pattern of a client’s responses. In particular he considers the concept of pat- 
terning with respect to a person’s responses to the items in a test. He cites as an 
example two hypothetical dichotomously scored test items each of which has a phi- 
coefficient of zero with a dichotomous criterion such as a “‘schizophrenic vs normal” 
group. Numerical examples are provided to show that even with a zero validity for 
each item, it may still be possible to differentiate the two criterion groups on the 
basis of the responses to the two items. To achieve this differentiation, a person is 
credited with a correct response if both items are answered in the same way (either 
true or false), and with an incorrect response if they are answered differently. Meehl 
designates this type of scoring as “‘configural scoring’’, and suggests that it may pro- 
vide a basis for developing predictive instruments even though the separate items 
in the instrument may have negligible validity. This approach seems to offer the 
possibility of objective qualification to those elusive patterns of response which 
clinicians have regarded as important. 

On the assumption that each of a pair of items correlated zero with a dichoto- 
mous criterion, Meehl was concerned with the following question: “Having given 
the inter-item phi-coefficient for each of the two criterion groups then, if the items 
were configurally scored, what would be the correlation of this score with the dicho- 
tomous criterion?” He makes the assumption that the proportion of cases in the two 
criterion groups is fifty per cent, and that within each criterion group the proportion 
answering each item “‘true”’ is also fifty per cent. He then determines empirically 
the configural validity of the items for certain specified values of the inter-item phi- 
coefficients in each of the two criterion groups. 

The purpose of this article is twofold. One is to show the precise relationship 
between “configural’” validity of the items. The second is to point out that the 
conventional method of scoring a test consists of a linear combination of item scores, 


whereas the “‘configural” method proposed by Meehl is a special case of a non- 
linear combination of item scores. 


MATHEMATICAL RATIONALE 


Suppose we take the special case considered by Meehl where the two criterion 


groups are of the same size and fifty per cent of each criterion group mark each of 
the two items “‘true.”” We let 


#, = the phi-coefficient for the two items for sub group 1, e.g. the normal 
group 


#, = the phi-coefficient for the two items for sub group 2, e.g. the schizo- 
phrenic group 


@; = the phi-coefficient for the configural score on the two items with the 
dichotomous criterion 


Following Meehl, the configural score on the two items is “1” if both items are 


marked the same either “true ’’or “false” and a if either one of the items is 
marked “true” and the other is marked “false.” 


It can be shown then* that 7 is given by the equation. 
®, — BD, 
Vv 4-(% + 4)? 








dy = 


*See appendix for proof. 
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This formula explains rather simply the phenomena noted by Meehl with reference 
to the configural validity of two items. He found empirically that if @. = — 4, 
that is if ®, is equal in absolute value but opposite in sign to #,, then the configural 
validity would be equal to #,. The reason for this is evident from the formula. 


If &, = — , we have from equation (1) 
$,— (— 4%) 
v 4-(% - 4, )? 





coy = 





2 


et 4 


or simply 
f= (2) 


Meehl also found empirically that , — 4, gives a good estimate of ®; provided 
2 

the average or sum of %, and , is not too large. Formula (1) shows clearly why 
this must be true. As &, + 4, becomes larger the denominator term becomes small- 
er. Therefore, the number by which 4; — 4, must be divided becomes increasingly 
less than 2. It is also obvious from formula (1) that when ®, = 4, then ; must 
be zero. 

Table 1 gives the value of the configural validity for specified values of 4; 
and ®». 


TABLE 1. VALUES OF THE CONFIGURAL VALIDITY ®T CORRESPONDING TO SPECIFIED VALUES OF 
®, AND ®, 








%, 
-.8 : . : 0 


. 229 





| 
_ 
So 


Se 3 F . , P -612 
-140 . ‘ 4 : 503 
346 -. ‘ ‘ .302 
-.437 -. : : é 204 -437 
-.524 -. : -.200 -. ‘ ‘ .346 .500 
-.612 -. ‘ -.302 -. A ae -250 .420 
-.704 <-. ‘ -.408 -. -. -.115 0 .140_ .333 
-.800 -. -.612 -.524 -.437 -. -.250 —-.140 0 .229 
1.0 | -1.000 -.905 -.816 -.734 -.655 -.577 -.500 -.420 -—.333 —.229 0 


to mh > @ 


ar a <r) 


0 








We arbitrarily let columns of the table represent different values of #, ranging from 
—1 to +1. The rows then represent values of , ranging from —1 to +1. For example, 
suppose we wish to find the configural validity corresponding to a ®, of .4 and a 
®, of —.2. We look for the entry which lies in the column headed .4 and the row with 
—.2 at its left. This entry is .302. 

Figure 1 shows graphically the relationship given in the tables. Each curve 
represents for a specified value of &, the relationship between #7; and 4. As %, 
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Relationship between the configurel validity 
of two test items as a function of their g 
coefficients. (Sce text) 


$; 
Figure a 


approaches zero the relationship between @; and ®, becomes more nearly linear 
although even for 4, = 0, this relationship is not a straight line. It is also clear from 
the chart that as &, departs either positively or negatively from 0 the departure of 
the curves from linearity are greatest as #; approaches @,. For example, when 
®, = -1.0, the departure of the curve from linearity becomes great only as 9%, is 
close to -1.0. The same is true for #, = +1.0 and 9%, close to +1.0. This is, of 
course, the phenomenon which Meehl discovered empirically. 

Next let us see how we may regard “configural” scoring as a special case of a 
non linear combination of item responses. We start with the assumption that a score 
on a single item is either “1” or ‘‘0’’ depending on which of two ways it was answered. 
We shall not here consider items having more than two possible responses or item 
scoring procedures admitting of more then two different scores for an item. The 
principle outlined here can be generalized to such cases, but it is beyond the scope 
of this paper to do so. 
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Now the usual procedure for getting a total score from a group of items is simply 
to add the number of item scores, that is, the number of items answered “correctly.” 
First let us note that this procedure is but a special case of the more general linear 
combination of measures. If we have for a person a set of test scores (X:, X2;. . . Xn) 
on n different tests then we may get a linear combination T of these scores by assign- 
ing a weight to each score so that 


T = bo + bi X% + De Xe +... + Da Xe (3) 


where bp is an additive constant. In particular the b’s in equation (3) may be re- 
gression weights, determined in such a way as to give the best prediction of some 
criterion measure. But suppose now that we let bo in equation (3) be “0” and all 
the other b’s we take as ‘‘1’’. Suppose also that each test in equation (3) consists 
only of a single item in which the score is either “0” or “1”. Then T becomes the 
conventional test score consisting of number of items correct. 

But except for the sake of simplicity, there is no particular reason why we must 
get a total score based on a linear combination of measures. If we have only two 
tests, or items, for example, we could get a total score, as shown in equation (4). 


T = bo + bixi + boxe + biexixe (4) 


Here the total score is a linear combination of the scores on x; and x2 and also the 
product of the scores, xix. Suppose now we let bo = 1, bi = —1, be = —1, big = 2. 
Then equation (4) would be 


T = 1-x; —Xe + 2xixe (5) 
Now suppose that tests 1 and 2 are single items with possible scores of only ‘‘1” and 


“0”. If a person gets a score of “1” on both items, then x; = 1, x, = 1 and xix, = 
1x1 = 1. We would have in equation 5, therefore 


T=1-1-14+2=1 
Next suppose a person gets a score of “0” on both items. We would have x, = 0, 
X2 = 0, xixe = 0x0 = 0 so that for equation (5) we get 
T=1-0-0-2x0=1 


Now if a person gets “‘0”’ on item 1 and “‘1”’ on item 2, we would have in equation (5) 
T=1-0-1+2x1x0=0 


Similarly, if he gets “1” on item 1 and “0” on item 2, we would have 
T=1-1-0+2x0x1=0 


We see therefore that equation (5) is the precise mathematical formula for the 
special case of configural scoring described by Meehl.* It is also of interest to note 
that equation (5) can be written in the form 


T = 1- (x1 — x2)? (6) 
or what amounts to the same thing 
T = 1— (x2. - x)? (7) 


From either equation (6) or (7) it is easy to see that if both items have the same 
score then the term in parenthesis must be zero and T is “1”. If on the other hand 
one score is “0” and the other ‘‘1’”’, then the term in the parenthesis is either ‘+ 1” 
or ““—1’’ depending on which score is “0”. But in either case the square of this differ- 
ence is “1” which makes T equal to zero. 


*See appendix for proof that the weights used in equation (5) result in the maximum correlation 
of T with the criterion. 
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CONFIGURAL ScoRING BY THE “H” TECHNIQUE 

Another interesting example of a type of configural scoring of items is given by 
Stouffer, et al.@) They have developed a procedure for selecting and scoring a set of 
items which they have named “‘H technique.”” By means not relevant to our dis- 
cussion triads or groups of 3 items per group are selected. Each triad is then re- 
garded as a single item. The methods for selecting and scoring the triads purport 
to yield item triads for a scale which give a higher coefficient of reproducibility than 
is the case when single items are used and scored dichotomously. Extension and 
generalization of the procedure seems to have promise not only for clinical psychology 
but also for other areas concerned with psychological measurement and evaluation. 
Of particular interest in our discussion is the method of scoring the triads specified 
by Stouffer et al. The procedure is simply to assign a score of “1” if all or any two 
items in the triad are “‘correct’”’ and “0” if none or only one are “correct.” This 
again is an example of what we may call “configural” scoring. But let us see how we 
may express such a score as a special case of a non linear combination of measures. 
Suppose instead of a linear combination of measures given by equation (4) we con- 
sider for the case of 3 tests, a total score which involves also all possible products of 
scores. Such a score would be given by equation (8). 


T = bo + bixi + boxe + baxs + biexixe + bisxixs + bogxexs + biosxiXexs (8) 


Now suppose we let bo = bi = be = bs = 0, bio = bis = bes = 1, and bis = -2. 
Then equation (8) becomes 


T = xiXe + xiXs + XeX3 — 2x1X0x3 (9) 


It is easy to show that equation (9) is the precise mathematical scoring formula for 
the triads which gives a score of “1” if all or any two items in the triad are “correct”’ 
and “0” if none or only one item in the triad are “‘correct’’. Consider the various 
possible patterns of item scores for a triad below. 





Pattern Item No. Triad score 
a 


eo 





ONO WH 
cooroOorore 
CorocorroOre 
coocoocOCO}r RH ee 











If the x values in equation (9) are taken in turn from the successive patterns of item 
scores given in the table above, we have the following solutions for the equation. 
Pattern No. 


BAAR AHR AA 
munud uu da 
oooror}e- 
MoM KK OK RK OK 
CoOrororre 
+4+++++4++ 
cooCcrOorRrF- 
ee 
oroorrFK ore 
++++++++ 
COOrFOrFOrFe 
muuunruan 
coococOoOrR Ree 


Although we have given only two specific examples of how “configural” scoring 
techniques can be expressed in terms of precise mathematical formulas the methods 
may be greatly extended and generalized to cover almost any configural scoring pro- 
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cedure which might be proposed. No doubt, with good reason, there is currently a 
great deal of interest and activity in the analysis of patterns and configurations of 
responses as indicated by the review of Gaier and Lee“. It is highly probable, how- 
ever, that the work in this area will be much more fruitful when more precise and 
rigorous mathematical concepts are developed to take the place of verbal formula- 
tions and analyses based on empirical or trial and error manipulations of the data. 


SUMMARY 


A mathematical formula is presented showing the precise relationship between 
the ‘‘configural”’ validity of two test items and the inter-item phi-coefficients for each 
of the two subgroups of a dichotomous criterion. The formula predicts the phe- 
nomena discussed by Meehl °. Two examples are taken, one from Meehl and one 
from Stouffer“ to show how exact mathematical formulas may be used to express a 
“eonfigural” score. It is suggested that tbe analysis of patterns or configurations of 
responses may become much more fruitful with the introduction of non linear math- 
ematical formulations and procedures. 


APPENDIX: ProoF or EQUATION 1 AND 5. 


Suppose we have two dichotomous independent variables, X and Y, and one 
dichotomous dependent variable, Z. Consider also a third independent variable 
which is the product XY. All correlation coefficients among the variables are to be 


taken as product moment correlations and since all variables are dichotomous they 
will be phi-coefficients. We let 


1 xy Pxcxy+) 
$= Pxy 1 Pyixyx) (1) 
xxv Pyvaxry 1 


Pxz 
é, = byz (2) 
| Pane _ 

where the ®’s are phi coefficients between the variables indicated by the subscripts. 
We assume now that 

Oxz = Pyz = 0 (3) 

Mx = My = Mz = 1/2 (4) 
Because of 4 it can be proved that 

Pyxy) = Pxay (5) 


If we let 4; be the multiple correlation of the variable Z with X, Y, and XY, it is 
well known that , 


#. = o) 6, (© 
Because of 3, 5 and 6 we can show that in scalar notation 
1 -— dy 


b> - Pixxyz 
1 + 2yx Piny — Way —- Pky 





3. “= Pixyz (1 + xy) (7) 
(1 + ®)xy — 2% ay 
Now the general formula for a product moment correlation coefficient is 
, is Cy —- MM, 


O;oj 
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where Cy is 2 (8) 


From 7 and 8 





Cxy a MxMy ) 


oxy 


) _2 ( Cxexy — MxMxy y (9) 











OyxOx 
But because of 4 and since all variables are dichotomous 
Ox = dy = Oz = 1/2 (10) 
Also we can show that 
Cxawn ss Cxy = Mxy (11) 
Using 4, 10 and 11 in 9 
@? 
3 sie (XW)Z 


1- Coe (12) 
Qoxy 


Or applying the general form 8 to 12. 
2(C — MxyMz)? 
@2 an (XY)Z xXy+Viz 13 
oh (20ky — Cxx) si 
Now because X and Y are dichotomous it is easy to show that 
oXy = Cxy- Ciy (14) 
Substituting 14 in 13 and using 4 and 10 


_ 4(2Cocns - Cxx)? 
ot = — Ga - Cay) _ 








Because of 4 
C C 
Car = x,%; XQYo (16) 





where Cx )¥, and Cx,y , refer to the two subgroups respectively of the dichotomous 
criterion. 


Also because of 4 


Cayz = Cones of Cog¥2)ty (17) 





But since all Z; values are “1” and all Z, values are 0, we have Cay -_ * 0 and 
17 becomes 


C 
X,Y, 


C xnz= 5 (18) 


Substituting 16 and 18 in 15 


(Cx,y, Fi Cx,y,)? 
$3} = (19) 
(Cxx, + Cx,x,) [F- (Cx,x, + Cx,x)| 


Now assuming that each Z group was equally divided on its dichotomous responses 
to both the X and the Y variables, it can be shown that 
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gs ea 
1/4 


Cx,y, = ae 1 (20) 


where ®; is the phi-coefficient between X and Y for the group with scores of 1 on Z. 
Similarily if we let , be the correlation of those with scores of 0 on Z, then 


Cx,y, = a (21) 


Substituting 20 and 21 in 19 gives 


o% = ( 4,— 2)? 
(2+ + @)[2-(4+ &) ] 





(22) 


®,— 
Vv 4-—(% + %)? 
which is the same as equation 1 in the text. 





Py 





(23) 


We have next to find the regression coefficients in the equation 
t = Bo + Bix; + BeX2 + BwXiX2 (24) 


Since By is an additive constant and will not affect the correlation of ¢ with the 
dichotomous criterion we shall consider first the solution for the other regression 
coefficients. Instead of the matrix and vector of correlations in 1 and 2, we consider 
the corresponding variances and covariances as in 25 and 26 respectively. 


co) ©6Pxy—s Puy | 
P = Pxy ot Pycay (25) 


Pxay Pyxy ory 


Also we let 
B 


Then it is well known that 
B = PP, 


But because of 3, 5 and 10 it can be shown that in scalar notation 


B, = Pxay (Pxy ny o*x) Pamnz 
1 om ms 


B. = Pxcy (Pxy yt o*x) Pawz 
ite D 
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B ine (atx x P*xy) Payz 
” D 
where D is the determinant of P. 


We let 
K = (ox a Pxy) 


Substituting 30 in 29 gives 


Bi = —PxanK 
Bz = —Ixcxy) 


Bu = (ox + Pxy)K 
But by definition 


Pxcxy) i Cxcxy) ) MxyMy 
Pxy = Cxy - MxMy 


Using 4, 10, 11 and 32 in 31 
B, = —Cxy(1 —- Mx)K 
Be = —Cxy (1 -_ Mx)K 
By = Cxy K 


_ K Cxy 
Pa 2 
Substituting 4 and 34 in 33 
By -G 
Bz —G (35) 
By = 2G 
Substituting 35 in 24 
t = Bo-— G(X; + Xe — 2X1X2) (36) 


Now any linear function of t will correlate the same with Z as t itself. Hence we 
write 


G 


T=At+E (37) 
where 


A 1/G 
E 1-— B/G (38) 


From 36, 37 and 38 
T = 1-—X,—- X2 + 2X.X2 (39) 


which is the same as equation 5 in the text. We have therefore proved that the 
weighting of the variables X;, Xs, and X,X_ indicated by 39 results in the maximum 
correlation with the criterion. 
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A FACTOR ANALYSIS OF CURVILINEAR DISTORTIONS ON THE 
BENDER-GESTALT! 


WILSON H. GUERTIN 
Veterans Administration Hospital, Knozville, Iowa 


INTRODUCTION 

The Bender Visual Motor Gestalt Test“ which was introduced in 1939 has 
found ready application with mental hospital patients. Interpretations are largely 
intuitive and based upon clinical experience with the instrument. Billingslea®?, 
Hutt“, and Pascal and Suttell have worked toward more unified scoring and’ 
interpretive schema. In a previous paper by the author“? an attempt also was made 
to see if some systematic order could be developed out of the relationships among 
scoring items through the application of factor analysis. Five fairly meaningful 
factors were extracted at that time from a rather sparse sampling of scoring cate- 

ories. 

. One cluster of scoring categories embodied curvilinear distortion and rotational 
tendencies. Since the corresponding factor was most heavily loaded with mixed and 
catatonic schizophrenics, the hypothesis was suggested that poor emotional control 
underlies certain types of distortions. Furthermore, since the cluster of variables 
seemed heterogeneous, it was expected to be subject to further simplification by sel- 
ecting pertinent variables and subjects. The present report is of a study attempting 
to include all the scoring categories that might be particularly related to this cluster. 


PROCEDURE 
The sample consisted of 100 white mental patients at several hospitals, 26 
females and 74 males. These subjects were classified grossly for statistical conven- 
ience into five diagnostic classifications. The distribution of subjects by diagnostic 


groups was: 13 non-schizophrenics, 23 paranoid schizophrenics, 15 catatonic schizo- 
phrenics, 14 hebephrenic schizophrenics, and 35 mixed schizophrenics. The non- 
schizophrenic group included character disorders and organic brain-damage cases 
in about equal proportions. 

A total of 46 scoring variables were studied, four of which had to be rejected 
because of an inadequate number of cases in the tails of the distributions. Nine of the 
variables retained showed tails with frequencies of occurrence of 11 to 20 percent of 
the cases. The scoring items employed stem from distortions encountered rather 
frequently in the productions of hospitalized schizophrenics. Thus, it can be seen 
that the study is directed toward an understanding of the factors underlying the pro- 
ductions of schizophrenics rather than a more generalized group of patients. Further- 
more, only those deviations were scored that were sufficiently gross distortions to be 
recognized without measuring techniques. 

The information from the data sheets on each subject were transferred to McBee 
Punch Cards for ease of tabulation. Tetrachoric correlation coefficients were graph- 
ically estimated from Thurstone’s Tables. The multiple group centroid method of 
factor analysis“? was employed because it is time-saving and likely to produce em- 
pirically meaningful clusters“). The clusters which determined these multiple group 
factors were identified by a conventional method which produces fairly tight and 
relatively independent clusters. Communality estimates were taken as the highest 
coefficient in each column except for those tests in a cluster. The communality estim- 
ates for cluster tests were taken as the highest intercorrelations between the particu- 
lar tests and any other member in the same cluster. The communality accounted 
for by each of the variables was arrived at from an orthogonal matrix which was ob- 
tained by rotating the oblique factor matrix. 


‘From the V. A. Hospital, Knoxville, lowa. The author is indebted to Dr. M. C. Addington of the 
Clinical Psychology Section, V. A. Medical Teaching Group, Memphis, for providing a portion of the 
tests herein employed. 
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RESULTS 


While extremely high correlations were absent from the intercorrelation matrix, 
six clusters were quite easily identified and appeared to be relatively independent of 
one another. The six clusters determined the oblique matrix with the six factors ab- 
stracted simultaneously. This resulting six-factor oblique matrix appears as Table 1. 
Intercorrelations between factors were quite low. The highest were -.32 between 
A and C, .32 between A and D, and -.30 between B and E (factors are lettered in 
their order of appearance in Table 1). 
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TABLE 1. Opiique Factor Matrix*—(Continued) 
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Laij? = 16.72 











*Decimal points omitted in all tables. 
**Based upon orthogonal matrix not reported here. 


The total variance accounted for by the six orthogonal factors constitutes 40 
percent of the total variance of the intercorrelation matrix or 78 percent of the 
estimated communality for these 42 variables. The estimated communality for the 
intercorrelation matrix was 51 per cent of the total variance. 
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TABLE 2. Factor LoapInes ror DiaGNnostic Groups 
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Loadings on these derived factors for diagnostic groups are shown in Table 2. It 
will be noted that there are a fair number of moderately high loadings. Some assist- 
ance in understanding the derived factors can be sought in their relationships to the 
previously reported Bender-Gestalt factors“). Mean correlations for cluster items 
of the previous study with the clusters in this study led to the correlations given 
between them in Table 3. 


TABLE 3. INTERRELATIONSHIPS BETWEEN OBTAINED FAcTORS AND THOSE FROM Previous Srupy - 
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DIscUssION 

This study, like the previous one, suggests that Bender-Gestalt performance can 
be understood fairly well in terms of common factors, since the communality of the 
intercorrelation matrix is slightly greater than half the total variance of the matrix. 
Such findings suggest that it would be profitable to employ clusters (or factors) as 
scoring categories, instead of using individual distortion items, when analyzing the 
performance of mental patients on the Bender-Gestalt. 

Rotations were not attempted inasmuch as the conventional cluster analysis 
employed would lead to a first approximation to simple structure. The apparent 
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meaningfulness of those clusters is further evidence that the positions of the reference 
axes are satisfactory. 

The six factors have been named inductively by considering the symptoms which 
showed the highest factor loadings. 


The Unstable Cloture Factor is confined almost entirely to Bender figure 5. One 
might suspect that this factor is largely an experimental dependence item, but the 
moderately high loading of .39 for catatonics on this factor suggests that it may be 
something more. It is characterized by a disapproximation of the lips of the cup in 
figure 5, which tends to reduce the likelihood of making the cup symmetric because 
of the larger distance spanning between the edges of the cup. Just why an individual 
would avoid approximating the edges of the cup and thus produce an unstable and 
asymmetric figure is not clear. It would appear that some emotional elements oper- 
ate to produce such an unstable configuration. The catatonics are the only ones 
positively loaded with this factor and it seems likely that this factor is related to an 
underlying general emotional instability. The highest negative loading is among the 
hebephrenics, and while they present emotional instability, it is more in the nature 
of a directly observable situational response rather than a general disturbance of 
emotional balance. 


The Curvilinear Distortion Factor is characterized by executional inaccuracy of 
curvilinear drawing. It may well be that the execution of curvilinear drawing has 
some sensual qualities which provoke impulsive responses. Thus, it is proposed 
tentatively that these curvilinear distortions are caused by impulsiveness with a 
possible emotional basis. This factor corresponds rather closely to a factor from the 
previous study which was concerned with the careless execution of more general 
aspects of the Bender-Gestalt. Further study will be required to see whether the two 
factors can be differentiated from each other. 


The Propensity for Curvilinear Movement Factor II seems to be an essentially 
psychotic factor. Not only are rotations involved, but a generalized propensity for 
curvilinear movement seems to operate in the drawing of the two minor arcs on figure 
4. These are well executed by individuals possessed of this factor inasmuch as they 
find a certain freedom in curvilinear expression with inherent change of direction. 
This facilitates the reproduction of each are so that they are proportionate to one 
another. The emotional disorganization and affect display which seem to underly 
this factor are reflected in the loading of the hebephrenics on this factor. 


The Fragmentation Factor also appears to be an essentially psychotic factor. 
It is characterized by a disjoining of the elements of the figure and either represents 
a misperception of the elements as being separated or a primitive defense to avoid 
feeling through dissociation. This factor seems to be primarily a factor concerned 
with defenses against feeling, perhaps largely against anxiety rather than emotions. 
Psychiatrically, this type of defense would be characterized as dissociation, except in 
the chronic deteriorated schizophrenic for whom the term dissolution would be 
more apt. 


The Irregular Propensity for Curvilinear Movement Factor seems to be a higher 
form of defense against feeling. We find that it is not unlike the Poor Spatial Con- 
tiguity Factor found in the previous study, which was encountered in character dis- 
orders and neurotics. These individuals seem to be experiencing conflict over emo- 
tions which are unacceptable to the ego. They frequently respond with neurotic-like 
mechanisms against these feelings, such as reaction formations. Despite the descrip- 
tion of these as “neurotic-like defenses” it is seen in Table 2 that the mixture of 
organics and character disorders (“‘non-schizophrenics’’) do not utilize these particu- 
larly, with the highest positive loading among the paranoids. Hebephrenics do not 
have the intellectual capabilities requisite for employing such high-level defense 
mechanisms and, as expected, are highly loaded negatively on this factor. 
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The Experimental Dependence Factor seems to stem entirely from inter-relations 
of scoring items on figure 4. Since the arcs were judged with respect to one another 
as to whether they were disproportionately large or small, the largest portion of the 
common variance is attributable to the scoring and the mutually exclusive possibil- 
ities inherent therein. However, a rather specific factor related to the disproportion 
between the small major arc and the large minor ones seems to have some importance 
for the group of mixed schizophrenics. Its meaning is most difficult to evaluate from 
this study alone. In general, it would be wise to disregard this last factor until it . 
may reappear in a future study. 


In summary, it may be stated that this attempt to factor analyze variables re- 
lated to curvilinear movement has disclosed at least some factors different from those 
obtained in a more gross analysis. There is reason to suspect that these obtained 
factors hold more diagnostic significance for the evaluation of emotions than do the 
factors previously reported“. The psychological meaning of these factors, unfor- 
tunately, is less clear than in the previous study. Only further cross validation with 
internal checks for psychiatric symptomatology ¢an provide sound principles of in- 
terpretation for the Bender-Gestalt productions of mental patients. Those psycho- 
logical significances attached to the factors in this paper are highly tentative, and 
while suggestive, should not be applied carelessly in clinical situations. 


SUMMARY AND CONCLUSIONS 


1. The previous factor analysis of the Bender-Gestalt “ produced a Propensity 
for Curvilinear Movement Factor I which seemed to be related to emotions. The 
present study was directed toward a further evaluation of propensity for curvilinear 
movement through a more restricted sampling of scoring items and patients. 


2. The Bender productions of 100 mental hospital patients were evaluated in 
terms of a number of semi-objective scoring indices. The intercorrelation matrix of 
these variables was factored with the multiple-group centroid factor analytic tech- 
nique. Factor loadings are presented for 42 test variables and five gross diagnostic 
categories. Relationships between the factors obtained from the current study and 
a previous one are also presented. 


3. The five oblique factors obtained which seem to have meaning are: the Un- 
stable Cloture Factor related to general emotional balance, the Curvilinear Distortion 
Factor related to impulsiveness, the Propensity for Curvilinear Movement Factor II 
related to emotional disorganization and affect display, the Fragmentation Factor 
related to dissociation or disorganization, and the Irregular Propensity for Curvilinear 
Movement Factor which is related to emotional conflicts and neurotic-type defenses 
against feelings. 


4. The results of the study were regarded as encouraging, and it is felt that an 
integrating study incorporating behavioral data may lead to a much sounder diag- 
nostic approach to the Bender-Gestalt productions of mental hospital patients. 
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A STORY COMPLETION TEST FOR COLLEGE STUDENTS 


EUGENE 8S. MILLS 
Whittier College 


In an earlier article“ the writer discussed a story completion test which was 
designed for individual use with children. The originator of this test found the 
story completion technique to be profitable clinically and this has been supported 
by subsequent application to children of different ages. 

In view of the fact that psychologists must deal with large numbers of college 
students and that they often must obtain material which will assist in making eval- 
uations of student personality, it would appear worthwhile to attempt to adapt the 
story completion technique to the age level and life experiences of the college student. 
It would seem particularly promising if the technique could be shown to work effect- 
ively in group testing. 

In order to try out the basic story completion technique in group testing at the 
college level, the writer composed over fifty story completion items dealing with 
various critical areas of student life. In deciding upon the areas of student life to be 
represented in the items, a study was made of several research projects which dealt 
extensively with the life and problems of college students“: ?- *. 5. 8), The seven gen- 
eral areas under which the items might be subsumed were: (a) home, (b) academic, 
(c) social, (d) vocational, (e) religion and philosophy of life, (f) athletic, and (g) 
health. The structure of the items was similar to that of the Madeleine Thomas 
Completion Stories Test items. 

Different combinations of items were administered to psychology students in 
two liberal arts colleges. An attempt was then made to select those items which 
analysis showed to be most productive. Many items were eliminated because of the 
perfunctory or stereotyped responses which they seemed to elicit. In general, how- 
ever, it was found that the students took the items seriously and responded in an 
open and spontaneous manner as long as the test was introduced with the proper 
guarantees as to the scientific and confidential nature of the material. 

After having experimented with each item, a test of fifteen items was estab- 
lished closely paralleling the Madeleine Thomas Test. The criteria for inclusion of 
an item were: (a) it must be maximally productive, and (b) it must complement the 
other items in the test. The test items included produced, as a unit, a wealth of 
material when administered to a final group of history and psychology students. 


Tue TEST AND 1TS ADMINISTRATION 


Following are the fifteen items which the writer has found to be most pro- 

ductive in group testing of college students: 

1. A student is in college. He! writes a letter home. What does he say in the 
letter? 

2. This student answers a question incorrectly in class. In the hall after class 
another student says to him: ‘‘You sure messed that one up, didn’t you?”” What 
does he do? 

3. The student calls at the college post office where he finds a letter waiting 
for him. It is from his parents. What does it say? 

4. The student has been to a movie with his best girl friend. After the date he 
returns to his room in the dorm. How does he feel? What does he do? 

5. The student has been going steady with a certain girl. One night she re- 
fuses to go out with him. Why? What does he do? 

6. The student tries out for the football team’. What happens? How does he 
feel? 


1In use With women students it is helpful to make proper substitutions for the sex of the hypo- 
thetical student in the stories, 

2In use with women students the substitution of the term ‘“‘volley ball team’’ facilitates the re- 
sponse to this item. 
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7. The student receives his final grades for the semester. How does he feel? 
What does he do? 
8. The student has been sick in the college infirmary for a few days. After 
giving him an examination, the college physician says....? 
9. Itisnight, the student is in bed. the day is over. What is he thinking about? 
10. Then he goes to sleep. What does he dream about? 
11. The student is now a senior. Is he eager to graduate or would he rather 
remain in college for a while? Why? 
12. What will he do after he graduates? 
13. Among all the novels and stories ne has heard and read, which one does he 
like the best of all? Why? 
14. If you were to write a book, what would you write about? 


15. Do you remember your early childhood? What is the first thing you can re- 
member now? 


In preparing the test for use with groups of students it has proven helpful to 
spread the items over several pages, allowing ample space between each item (and 
between each section of the items) for the student to write his answers. It has also 
seemed necessary to urge the student to extend his answers to the reverse side of the 
pages when necessary, after having indicated the number of the item he is answering. 

In using the story completion test with groups of students much of the value of 
the test depends upon the extent to which the students have been prepared for the 
test items. It seems desirable to talk informally with them prior to the test about 
the nature of the task, giving them assurances of the confidential nature of the mat- 
erial, and urging them to enter freely into the stories. In practice, the writer has 
made introductory comments such as the following: 

You are going to be given some pages on which are written a number of short stories or items 
about a college student. Each story describes a situation and asks a question about that situa- 
tion. You are to read the story item and then complete it in any way you wish, extending your 
completion of the story to the back of the page if space allowed is inadequate. "When doing so, 
however, please indicate the number of the story you are completing. Please feel free to use your 


imagination in completing the stories because there are, of course, no right or wrong endings 
for them. 


Metuop or ANALYsIS oF Test RESPONSES 


The technique which has been used for analysis of test responses is essentially 
the same as that devised by the writer“ for analysis of the Madeleine Thomas Test 
and is similar to that used by Symonds“? in his research on adolescent fantasy. In 
essence, the task was one of theme analysis of all story completions. The writer’s 
experience has been the same as Symonds who found that, “No ‘system’ seemed to 
be wholly satisfactory. Nothing took the place of giving the task one’s individual 
attention, keeping as many details as possible in memory, and maintaining a flexible 
attitude” © P. 71), 

In analyzing the fantasy material produced in response to the stories, the writer 
removed all notations which served to identify the subject who had produced those 
responses in order to minimize the possibility of prejudice in his evaluation. A 
general survey was made of all responses in order to become acquainted with the 
subject’s style of writing and his orientation toward the test stories. Following this, 
each story completion was read slowly, particular attention being paid to recurring 
themes, omissions, confused writing, witty comments, and other indications of dis- 
turbance. Recurring themes and other qualitative observations were noted in .the 
margins next to the story or stories concerned. 

After each response had been thus studied, a further general survey of the test 
as a whole was made in an attempt to relate notations to the over-all body of the 
test. Following this operation, the responses were reread in reverse order and further 
notations made as necessary. Two days later the writer came back to the test 
material and repeated the above operations. An attempt was made to distill further 
the qualitative comments which had been made previously, sharpening the nota- 
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tions to describe more accurately the dynamic qualities of the responses and elimin- 
ating unnecessary duplication in noting themes. 

It must be stated, of course, that the writer has not always been certain as to 
how to characterize specific themes. Often the projective material is such as to defy 
classification, while at times it is difficult to decide whether or not material should 
be listed as two separate themes or as one theme. While Symonds“ chose to list 
themes only when they appeared in the projective material at least three times, the 
writer has made no such limitation in recording theme material. In effect, it often 
appears that a particular theme occurs just once in a subject’s test, but that in light 
of other test indications such a theme constitutes a significant element in his ad- 
justment. 


RESULTS 


At the present time the test has been administered to 312 history and psychology 
students in three liberal arts colleges and the responses analyzed in accordance with 
the general technique just discussed. A total of 6,864 themes have been tabulated 
according to the number of the story item where they appeared. Table 1 presents the 


percentages of themes of different types which appeared in completion of the separate 
items. 


TABLE 1. PERCENTAGES FOR THEME ANALYsIS OF RESPONSES TO Eacu Story ITEM 
(FicurEs RouNDED TO THE NEAREsT 1%) 
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It is apparent from this table that a variety of themes was present in the res- 
ponses to the test and that certain items are more likely to elicit a particular theme 
(or themes) than are other items. Examples of this are seen in the case of items 1 
and 3, in which approximately one-half of all themes concern family relationships, 
and items 11 and 12, which produce a wealth of material concerning vocational mat- 
ters. While both items 1 and 3 elicit material dealing with family relationships, the 
different approach in the two items makes possible an instructive comparison of the 
student’s attitudes with what he feels to be the attitudes of his parents. Since family 
relationships are of great importance in the lives of most college students, this test 
condition facilitates an understanding of many student problems. Similar inter- 
item comparisons are possible in other critical areas of adjustment. 

There is, of course, considerable difference in the time required for students to 
complete the story items. On 246 tests which were timed to the nearest minute, the 
range was 7 to 59 minutes. It has been observed that the more'seriously disturbed 
students tend either to complete the test in a relatively short time or, as is sometimes 
the case, to spend an excessively long time on the task. 

A limited test of this observation was possible with the data at hand. Rorschach 
records to which the Munroe Inspection Technique had been applied were available 
on 43 cases. On the basis of the size of Munroe scores, the writer ranked the records 
from the most seriously maladjusted to the most favorably adjusted. By arbitrarily 
selecting the top ten, which Munroe scores revealed to be the most poorly adjusted, 
and plotting these ten cases against the time taken to complete the story completion 
test, it becomes apparent that they tend toward the extremes. Of the ten cases 
studied, six took fifteen minutes and under to finish the test, while the remaining 
four took forty-five to fifty-nine minutes. The time required by these ten cases 
stands in rather striking contrast to the mean time of 32 minutes for the 246 cases 
tested. This suggests that the time factor may, in itself, be a significant clue in the 
use of such a technique. 


ILLUSTRATIVE CASE 


The test has been used primarily with groups of college students; however, it 
has also been helpful to the writer in individual use with students who are experienc- 
ing difficulty in adjusting to college life. Once rapport has been established with the 
disturbed student the administration of the story completion test by the counselor 
in a face to face relationship offers a convenient entry into the student’s problems. 
The experience of providing stories for the items often facilitates the recognition and 
discussion of critical areas of adjustment. An example of this is found in the case of 
a 20 year old student who came to the writer complaining of a continuing inability 
to decide upon a vocation. Following a brief period of introductory discussion the 
story completion test was administered orally to the student. Answers to three of the 
items illustrate the useful material obtained by this method. 


Item 1. (A student is in college. He writes a letter home. What does he 
say in the letter?) ‘I guess he says, Dear Folks: I’m getting along O. K. Uh... 
my studies are O. K., too, but I can’t seem to get an idea of what to do with 
them. Here I ani—I’m in college and I don’t know what I want to do yet. I 
like my science courses but I make better grades in English and Drama. Please 
don’t lose patience with me... I’ll try to get situated soon. I could use some 
more dough. Love, Johnnie.” 


Item 3. (The student calls at the college post office where he finds a letter 
waiting for him .. . It is from his parents. What does it say?) ‘“Dear Johnnie: 
Er... I got your note. Sorry you haven’t had any luck getting your mind made 
up yet. We're paying for your. . . we’re paying your way and naturally we’d 
like to see some progress on this. Your mother and I hope you will keep in 
mind...uh... our talk at Christmastime. Love, Dad.” 
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Item 11. (The student is now a senior. Is he eager to graduate or would 
he rather remain in college awhile? Why?) ‘‘He’d like to, I guess, get out. . . but 
he also doesn’t look forward to the big push after the protection of college walls. 
You know . . . ivory towers and all that stuff. He’s got to make up his mind 
soon... or someone else will make it up for him.” 


In this case the story completion responses provided much insight into the 
difficulties besetting the student. It was possible to infer from the responses that the 
parents were significantly involved in the case. The talk mentioned in response to 
item three was a clue which was followed up in a subsequent interview. In this case 
the parents had been pressuring their son to study law in the expectation that he 
would some day join his father’s law firm. During the previous Christmas vacation 
the father had promised his son a new car if he would apply for admission to law 
school. The degree of anxiety and concern which the student felt was amply demon- 
strated in the story completion responses along with valuable clues as to prevailing 
conditions in his case. 

The story completion test does not appear highly threatening to most students. 
For this reason it has been useful as a first test in those cases where considerable 
testing seems indicated. Besides providing valuable information early in the re- 
lationship, it helps to clear the way for the Rorschach, T. A. T., and other tests 
which usually constitute a greater threat. 


SUMMARY 


The writer discussed a story completion test for college students consisting of 15 
items dealing with critical areas of student life. The test was administered to 312 
history and psychology students in three liberal arts colleges and a theme analysis 
made of the responses. The time required for administration of the test was dis- 


cussed and consideration given to the value of the technique as an opening wedge 
in testing and counseling relationships. 
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PERSONALITY INTEGRATION AND THE PERCEPTUAL PROCESS! 


ELI A. RUBINSTEIN 
Veterans Administration, Washington, D. C. 


PROBLEM 


This investigation concerns the influence of personality integration on the per- 
ceptual process by examining the relationship between personality integration, as a 
quantitative aspect of personality dynamics, and the perceptual process. Is the per- 
ceptual process increasingly influenced by internal, personal factors of the perceiver 
as personality integration decreases? 

Previous studies in this field relate to this investigation more by implication 
than directly “: 2.3.5.7, In their sum total they lend support to the hypothesis that 
degree of personality adjustment influences perception, since the greater the person- 
ality maladjustment the more probably there will be strong somatic needs, intense 
personal values and attitudes, or strong emotional tensions to influence the percept. 

We can hypothesize two broad continua entering into the determination of 
relative weights of internal and external factors in the perceptual process. On one 
hand, the degree of structuration of stimulus should largely underly the influence of 
the external stimulus. On the other hand, the degree of personality integration should 
largely underly the pressure of the internal factors. Given a series of stimuli of 
graded ambiguity, the greater the structuredness, the greater the conformity in re- 
sponse. But, at the same time, given a group of perceivers, the greater the personal- 
ity maladjustment, the less the conformity in response (assuming the maladjustments 
are not all identical in nature.) If these hypotheses are valid, we should be able rough- 
ly to predict the result of presenting a series of stimuli of graded ambiguity to a 
group of individuals of varying personality adjustment. We would expect that the 


greater the structuredness of stimulus, the greater the conformity of response. But, 
in addition, we should expect to find, as a modifying effect in this framework, that 
the greater the personality disturbance, the less the conformity of response. The 
better the personality adjustment (and, therefore, the less the influence of individual 
internal factors), the earlier in the presentation of the series of graded stimuli will 
we receive the “expected” or conforming response. 


EXPERIMENTAL DESIGN 


Stimulus Material. A series of incomplete line drawings of simple objects was devised 
and three graded stages of ambiguity were produced by successively stripping the 
incomplete drawings of key parts until only the barest implication of the drawing 
was left. Thus, in a sense, the series was developed backwards. As actually pre- 
sented to the subject, each individual line drawing was on a card, 36 cards in all, and 
each card was shown to the subject for as long as needed to elicit a response. The ex- 
perimenter had, at the same time, a recording sheet on which were reproduced the 
line drawings. He roughly sketched the response on the sheet in accord with the 
explanation of the subject, who was asked to trace his response, with his finger, on 
the card. Thus, no actual drawing was done by the subject. All first stage presenta- 
tions were shown, then all second stage and then third stage presentations, rather 
than running through the three stages for each object successively. 

The content of this graded-in-structuration series of line drawings was con- 
sidered from two aspects. The attempt was, first of all, to have objects which, in 
complete form, have fairly definite and well-known configurations. Objects which 
lent themselves well to simple line representation were considered. In addition, the 
range of content was restricted to everyday, ordinary objects so as not to complicate 


*This article is based partially on a doctoral dissertation submitted to The Catholic University of 
America. Grateful acknowledgments are due to Dr. William D. Commins for his assistance in the 
original study, and to Dr. A. C. Cornsweet for his cooperation in the continuation of the research. 
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the problem by the influence of cultural differences. As finally adopted, the set of 
stimuli included; a shoe, a hand, a profile, a fish, a dog, a butterfly, a watch, a tree, 
and a house. Thus, there were three human details, three animals, and three com- 
mon objects. 

It should be noted that this totals only nine presentations whereas the series 
actually contained 36 stimuli or 12 to each of the three stages. The additional three 
presentations per stage consisted of line drawings which became successively more 
complex, but at the same time less, rather than more, structured or unambiguous. 
Whereas the first nine drawings increased in complexity per stage toward a specific 
representational configuration, these three “nonsense’’ presentations increased in 
complexity away from any specific, well-known representation rather than towards 
such design. These three nonsense stimuli were introduced to hinder the establish- 
ment of a mental set in which the subject would, upon presentation of the second 
stage drawings, soon realize the increasing structuration and possibly withhold doubt- 
ful responses in anticipation of the even more structured third stage. It was also 
in order to prevent this mental set that the cards were presented as they were, rather 
than running through the three stages for each object successively. Furthermore, 
the cards were changed in sequence from stage to stage. 


Subjects. In order to eliminate various cultural and environmental differences as 
much as possible, all the subjects were white male war veterans. On the possibility 
that organic brain damage might influence the test results, all brain damage cases, 
cases having undergone shock treatments of various kinds, or cases with any central 
nervous system disturbance were excluded from the study. Also, the Wechsler- 
Bellevue Intelligence Test was given to all subjects so that the influence of intellect- 
ual factors could be determined and statistically handled. 

The sample consisted of 125 cases distributed over a fairly wide range of degree 
of personality adjustment. The subjects included patients of two mental hospitals, 
two mental hygiene clinics, and veterans applying to a university counseling center. 


Criteria of Personality Integration. It was decided to use a single, over-all rating scale 
for the purpose of getting a measure of personality integration. A brief statement 
was constructed establishing the frame of reference in which the rating was to be 
made. Using that as a basis, a single five-step rating scale was devised as follows: 
1. No indication of sufficient conflict, maladjustment, or personality disintegration to cause the 
patient to be considered abnormal. 
2. A mild degree of conflict, maladjustment, or personality disintegration which prevents the 
patient from functioning on a completely adequate level. 


3. A moderate degree of conflict, maladjustment, or personality disintegration which signifi- 
cantly restricts and makes unsatisfactory the patient’s relationship with himself and his normal 
environment and which strongly suggests the need for some form of readjustive pr ess. 


4. A marked degree of conflict, maladjustment, or personality disintegration which makes it 
impossible for the patient to effect any adequate relationship at the present time with his normal 
environment without some comprehensive readjustive process. 


5. An extensively severe degree of conflict, maladjustment, or personality disintegration which 
necessitates institutionalization or special therapeutic and custodial care, since even minimal 
socially adequate functioning is not possible on an independent basis. 


This rating scale was filled out three times for each subject. In each case where 
a psychiatrist had extensive personal contact with the individual (this was the case 
for 66 individuals) the psychiatrist was asked to rate the individual. The psycholo- 
gist who tested the person and wrote a report on him was asked to fill out another 
rating. The author, independent of the other ratings, reviewed the case history or 
social history of each individual and rated him a third time on the five point scale, 
taking into account such factors as: nature of illness, if any, length of illness, length 
and number of hospitalizations, nature of onset, description of behavior, and any 
other pertinent data presented. In addition to these ratings, each individual was 
given a “Basic Rorschach Score” based on the Buhler-Lefever “ check list. 
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RESULTS 


Rating of Personality Adjustment. The product-moment correlation between the 
combined ratings and the Basic Rorschach Scores was .56, and showed that both do, 
to a considerable degree, measure the same thing. It was then decided to combine 
all four measures, the three ratings and the Rorschach Score, so as to have a com- 
bined rating for each individual. This was done by converting the Basic Rorschach 
Score distribution into five equal percentage groups to agree with the five-point 
rating scale and pooling all four scores into one total score. 

The reliability was determined by average intercorrelation of all ratings as 
shown in Table 1. Guilford’s Tables“: ». ®® were used for converting correlations 
to Fisher’s Z, averaging the Z’s and then reconverting to a correlation coefficient. 
The average intercorrelation for all six correlations (three estimated uncurtailed dis- 
tributions for the psychiatric ratings) was .75. This was one estimate of the relia- 
bility coefficient for the four sets of ratings. 


TABLE 1. INTERCORRELATIONS AMONG THE Four RatINGs 








Psychiatrist Psychologist, History 





66 Cases 125 Cases 66 Cases 125 Cases 66 Cases | 125 Cases 
Psychologist. 76 .88* 
History .64 eh" Ay f°) 
Rorschach .38 ng .33 


























*Estimated uncurtailed distribution (McNemar, p. 126) 


The Graded Stimulus Test Scores. If the analysis and scoring of the graded stimulus 
test was based on a priori theoretical grounds, rather than empirical item analysis, 
and then correlated with the ratings afterwards, whatever measure of relationship 
would appear more probably would exclude the possibility of a positive bias due to 
unique factors in this particular sample. Scoring on theoretical grounds would also 
give a degree of independent meaning to the tests not obtainable by the above purely 
empirical method of scoring. For these reasons, the approach to the test analysis was 
made independent of the ratings. 

As already explained, the stimuli used, except for three nonsense figures, were 
actual incomplete line drawings in three stages of ambiguity. The obvious first task 
was to tabulate the number of “correct”? and “‘incorrect’”’ responses. It should be 
expected over the entire sample that, despite the differences expected among the five 
ratings, there should be a definite and a sizable decrease in incorrect responses from 
Stage I, with greatest ambiguity, through to State III, with least ambiguity. Table 
2 illustrates that such was the case. For the nine meaningful figures in the series it 
can be seen there was a definite and marked decrease in other than expected responses 
as structuredness increased from stage to stage. (The three nonsense stimuli, with no 
correct response, were treated somewhat differently, as will be explained.) 

Even on purely theoretical grounds it would have been apparent that an in- 
correct response to a stimulus at Stage III should be weighted more heavily than an 
incorrect response to Stage I, when the stimulus is so much more ambiguous. It had 
been intended to weight the items, accordingly, with weights of 1, 2, and 3. How- 
ever, in the final weighting, acknowledgement was made to the empirical evidence by 
giving relatively greater weight to the more infrequently found incorrect responses 
in Stage III. The weights assigned were 1, 2, and 5, for each of the three stages 
respectively. 

The next area of significance was the series of three nonsense stimuli. On such 
perceptual tests as the Rorschach, rejections are considered important. This was a 
particularly appropriate approach in regard to the nonsense stimuli since they were 





ELI A. RUBINSTEIN 


TaBLe 2. “INcoRREcT’ REsPONSES BY ITEM AND BY STAGE 
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Animal 


Tree 
Profile 
Watch 
Butterfly 
Hand 


Totals | 834 | 
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specifically designed to elicit no definite response. Accordingly, the scoring on these 
items involved tabulations of the number of ‘Don’t Know” responses. From a 
theoretical standpoint it should be expected that with the least complex presentation 
of Stage I it should be easier to make a response than with the more complex, but not 
more meaningfully structured, presentation of Stage III. A weighting of 3, 2, and 
1 was adopted for responses to Stages I, II, and III, respectively. 

It might be mentioned that ‘‘Don’t Know” responses for the meaningful stimuli 
also occurred occasionally. In this case, since that was already a wrong response 
and weighted accordingly, the rejections here were given one extra point to take into 
account this special type of response. 

We had, thus, a quantitative measure of what might be called both perceptual 
distortions and rejections. In addition, the following five qualitative types of res- 
ponses, each given a weight of 5, were considered as being both among the most 
significant and, as another criterion for selection, the most easily and objectively 
identified : 

1. . Bizarre, contaminated, or symbolic responses. These were responses in which 

the content was flagrantly unreal, involved the combination of two incompat- 

ible items or utilized the content as symbolic of great personal significance. 

2. Sex responses. These were responses in which the drawing included the 

identification of some primary or secondary sexual characteristic. 

3. Morbid or horror responses. These were responses in which the content in- 

cluded some dysphoric feeling tone. 

4. Cut-off responses. These were responses which involved some violation of the 

objective limitations of the stimulus material and ignored certain parts of the 

figure presented, changed parts of the figure, or indicated that the drawing 
should continue outside of the card. 

5. Repitition. This scoring involved repetition of responses from one stage to 

another and from one item to another. This category was considered only for 

the nine nonsense figures. The reason was that for these items there was no 
meaningful development from stage to stage and therefore repetition was more 
correctly a perseveration than it was in the case of the other nine stimuli. 


Total scores were computed for each of the 125 cases and a product-moment 
correlation was obtained between these weighted scores and the sum of the four 
ratings for each subject. This correlation of .47, is given in Table 3, as are the cor- 
relations between these total scores and 1.Q. and between I. Q. and ratings. 
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TaBLE 3. INTERCORRELATIONS AMONG TEsT ScorgEs, 
RatTInGs, AND I. Q.’s 








Variables r or 
Rating and Test Score .55 .063 
Rating and I. Q. —.38 .077 
Test Score and I. Q. —.39 .076 

T12.8 47 .071 

















Using the split-half correlation as a measure of reliability and correcting for 
length by the Spearman-Brown formula, the reliability coefficient for the test scores 
was found to be .76. However, in correcting for attenuation the correction was 
made in the criterion only ‘: ?. 5°), The reliability for the ratings was applied to the 
first-order partial correlation given in Table 3 and the corrected correlation between 
the two measures was found to be .54. 

Further examination of the relationship between the two variables was made 
by grouping the test scores according to the five rating groups. (The total score of 
the four pooled ratings was divided by the number of ratings to give an average 
rating which was then rounded off within the original five-point grouping.) Table 4 
shows that, while there was considerable overlap of test scores for adjacent rating 
groups, there was a distinct trend toward differentiation of all five groups. 


TABLE 4, DIFFERENTIATION OF TEsT Scores BY RaTING 








Rating Mean 8. D. oM Range 
i. 20.00 6.69 1.46 8-37 
23 .64 6.83 1.49 12-38 
27.43 10.06 1.73 13-56 
32.69 12.94 2.54 15-61 
47.11 20.86 4.92 























DIscUSSION 


Sherif and Cantril, in their discussion of gradation of structure in the stimulus 
field @: PP. 4548) state that few studies have investigated this variable. This observa- 
tion is borne out in our own review of the literature. However, they believe that 
even present meager results point to the fact that increased structuredness produces 
decreased influence of internal factors in the percept. 

We have already noted that for the entire sample the proportion of incorrect 
responses to the meaningful stimuli decreases considerably from Stage I through 
Stage III (Table 2). It is these data which seem to verify the above hypothesis. When 
the organization of the stimulus material is such that its structure seems to compel 
immediate and unequivocal recognition, even severe internal pressures such as ac- 
company extreme personality maladjustments do not have a major influence on the 
perception. 

However, while it was expected that degree of structuration would have an im- 
portant influence on the perception, we expected to find the operation of another 
variable, degree of personality integration, to be influential within this framework. 
It would seem that the extent to which the perception takes on attributes of the 
structure of the stimulus increases more rapidly, with changes in structuration in the 
stimulus, in well-adjusted individuals than in severely disturbed individuals. 
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Bruner and Postman: P?. §8) speak about selection, organization, accentua- 
tion, and fixation, as four broad functions performed by the perceiver. Subjective 
factors may influence the perception in any one or all of these functions. Further- 
more, for each of these functions there are undoubtedly a number of different pro- 
cesses, in accord with specific personality dynamics, whereby the subjective factors 
may influence the perception. 

We emphasize this point in order to help explain the extensive overlapping 
which occurred among the test scores for the five rating groups. The analysis of the 
stimulus material, while sufficient to confirm our original hypothesis, was not cap- 
able of revealing the full relationship of our two variables. Our test material tapped 
only a limited number of ways in which the internal factors might operate in the 
perceptual process. This observation explains not only the overlapping in scores 
among the ratings but also the marked increase in variability of scores among the 
severely maladjusted subjects. As we go from normals to the severely maladjusted 
individuals it would seem that an increasing amount of the personal factor is intro- 
duced into the perceptual process. If this is accompanied not only by increased in- 
tensity in the operation of the personal factor but by an increasing number of differ- 
ent methods of introducing these personal factors then any technique which is sensi- 
tive to only a constant number of methods would become less and less specific to the 
total process as an increasing amount of the personal factor operates. Thus, for our 
Rating 1, where the personal factor, for all three stages, is least operative, there 
would be lowest scores no matter if only a portion (but of necessity a significant 
portion) of the methods was measured. On the other hand, as the intensity and 
diversity of subjective factors increased, some scores would be high—where the 
methods used were those tapped by the technique; and some scores would be decep- 
tively low— where the methods used were mainly outside of those measured by the 
technique. 


FURTHER RESULTS 


In an attempt at some cross validation of the results found here, the graded 
stimulus material was presented to 68 additional out-patients at a Veterans Admin- 
istration mental hygiene clinic. Again, these were white, male veterans with no 
known organic brain damage. The scores on the test were correlated with a single 
rating on the five-point rating scale. This rating was made by the psychologist doing 
the complete psychological examination. Two changes in the original procedure were 
made. Since this sample of 68 cases varied less in severity of illness than the original 
group, half step intervals were used in the rating to increase the number of groupings 
from three to six. The range of ratings went from 2.0 to 4.5 with a mean rating of 
3.1. The stimulus material was unchanged in content and scoring except that three 
of the series; the tree, the watch, and the profile were omitted from each stage. These 
three had been among the least discriminating of the original 12 items. Thus, there 
were 27 rather than 36 cards in all. The product-moment correlation was .52, which 
is quite close to the uncorrected correlation of .55 on the original 125 cases and would 
seem to argue that our a priori analysis and scoring did achieve some measure of 
relationship to the rating of personality integration. 


SUMMARY AND CONCLUSIONS 


This study attempts to investigate one aspect of the very complex relationship 
between personality and the perceptual process. Degree of personality integration 
was measured by using a five-step rating scale on 125 subjects obtained from varying 
sources. Reliability of these ratings, established by intercorrelation of three in- 
dependent ratings, was .75. 

Structuredness of stimulus was graded into three steps by taking a series of nine 
incomplete line drawings of simple objects and removing, in two successive steps, 
key details of the drawing. In addition to the nine drawings, three nonsense draw- 
ings were interspersed so as to prevent a mental set from developing in regard to the 
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increasing structuredness from stage to stage. Responses to the graded stimulus test 
were analyzed and scored according to three aspects: failure to see the ‘‘expected”’ 
response, rejections, and various qualitative indications. Total weighted scores on 
this test were then correlated with the Mean Ratings for the subjects and the cor- 
relation of .55 proved substantially significant. However, intelligence, measured by 
the Wechsler-Bellevue Scale, was also found to correlate significantly with both 
ratings and test scores and, therefore, had to be partialled out. The partial co- 
efficient between ratings and test scores was .47. 

The results did largely confirm our original hypotheses, and the following con- 
clusions seem warranted in the light of this study: 


1. All other things being equal, the role of internal factors in the perceptual 
process decreases with increased structuration of stimulus. 


2. All other things being equal, the role of internal factors in the perceptual 
process increases with decreased total personality integration. As a corollary of this 
it might be stated that individuals who are well-adjusted are quicker to restructure 
their percepts in accord with the stimulus as structuredness increases than are in- 
dividuals with severe maladjustments. 


The implication of these findings is that the relationship between the perceptual 


process and the level of personality integration, although complex, is distinct and 
substantial. 
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PROBLEM 


The process of rationalization as an ego defense mechanism has been universally 
recognized. Since Jones first used the term in 1908 there has been a great deal of 
descriptive literature about rationalization. However, there exists a paucity of ex- 
perimental or statistical studies centered about this self-justification concept. This 
is especially true with respect to the commonly heard rationalizations or explanations 
offered by subjects for not doing as well as they might have done on paper and pencil 
tests usually given in vocational guidance. 

Because of its frequent occurrence in everyday life, an analysis of rationaliza- 
tion has implications for the psychotherapist, teacher, parent, and all personnel 
workers who are interested in the ego-defenses of the people they deal with. 
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The purpose of this study was (1) to categorize the reasons offered to explain 
test behavior and determine their frequency; (2) to compare the categories of reasons 
with each other with reference to objective test scores of temperament and the 
abilities. 

METHOD 

Preliminary survey. In order to gather as many rationalizations as possible, a 
random portion of a senior high school class was canvassed to obtain the one most 
important reason why they did not do themselves justice on any or all tests they had 
taken in the past. No tests were administered previous to this request. They res- 
ponded to a prepared statement reading verbatim as follows: 


“We often hear people say that they could have done better in tests than 
they have done. We would like for you to write the one most important reason 
why you think you could have done better than you have.” 


The rationalizations or reasons thus gathered were given to a committee of three 
psychologists who were instructed to label each reason with a word, a phrase or 
sentence that would in their opinion characterize the statement. The judges worked 
independently and labeled each statement in any fashion they chose. 


Inspection of the judges’ statements which overlapped and showed agreement 
revealed the following clusters of reasons: 


Time or speed factors; 

Nervousness or anxiety; 

Carelessness or poor work habits; 

Self-depreciation or inferiority feelings; 

Poor motivation or lack of concern; 

Did best or no excuse; 

Body aches or physical discomfort; 

Tests were inadequate to measure what they were supposed to; 
Never got started or misunderstood directions; 

10. Miscellaneous. 


» 
2. 
3. 
4. 
6. 
ri 
8. 
9. 


Since the great majority of the reasons for test behavior secured were direct and 
clear as to meaning, it was decided by the committee of judges that only those state- 
ments which had unanimous agreement would be acceptable for categorization. Of 
the 137 statements in the preliminary survey, six were eliminated because of being 
illegible, blank, or semantically unclear, leaving 131 statements on which there was 
perfect agreement. 

From these a set of directions for selecting categories of rationalization was 


developed, and this became the basis for subsequent categorization. These are as 
follows: 


DIRECTIONS FOR SELECTING CATEGORIES OF RATIONALIZATION 


Each card containing a rationalization will be numbered. When you have 
chosen a category, say Rationalization 1, Rationalization 2, etc., for each card, 
first recording the card number and then the Rationalization number on the 
sheet provided. 

Read the following examples for each category before selecting 

Rationalization 1: When a doubt is expressed as to time, as “test is too short’, 
“worked too slowly’’, etc. 

Rationalization 2: When tenseness or emotion is expressed, as “I was tense’, 
“T was nervous’, etc.; expressions of instability, moodiness, being bothered 
by noise, etc. 

Rationalization 3: When careless work habits, poor study methods, or working 
on the wrong area of the answer sheet is expressed, etc. 

Rationalization 4: When self-depreciation is expressed, as ‘“Never do well on 
tests’, or ‘I shouldn’t be taking tests’, etc. 

Rationalization 5: When poor motivation is expressed, as “don’t care’, “it 
makes no difference’’, ete. 

Rationalization 6: When no excuse, or “I think I’ve done my best”, is expressed. 
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Rationalization 7: When bodily aches or pains are mentioned: headache, cold, 
complaints about eyes, glasses, light, hearing, weather, etc. 

Rationalization 8: When the test is considered unfair or inadequate for measur- 
ing the individual’s particular interests, abilities, etc. 

Rationalization 9: When instructions are misunderstood and the individual did 
not know what to do. 

Rationalization 10: When none of the above rationalizations is expressed. 


Following the delineation of the rationalization categories to be used in this 
study, specific statements from students justifying their test behavior after certain 
standardized tests of temperament and mental abilities had been administered were 
secured. The standardized tests used were the PMA, the Guilford-Martin GAMIN- 
STDCR factors, the Memory Test of the Factored Aptitude Series developed by 
King, and the Perceptual Speed test of the Guilford-Zimmerman Aptitude Survey. 


Subjects. Five hundred sixty-one high school seniors comprised the sample 
utilized in the principal portion of the investigation. Data were obtained on seven 
hundred twenty-four students, but one hundred sixty-three cases were eliminated 
because administrative rules prevented the use of the test scores of one of the schools. 
The rationalizations of these one hundred sixty-three students were kept, however, 
for frequency comparisons. 


Procedure. After the standardized tests had been administered, the prepared 
request to justify their test behavior was read to them as used in the preliminary 
survey. The explanations or rationalizations were then categorized by the use of the 
Directions for Raters on Selecting Categories of Rationalization. The same three 
judges participated in the categorizing of those statements as were utilized in the 
preliminary survey. 


RESULTS 


Of the 724 statements so categorized, there was not a single case of rationaliza- 
tion 4 or “self-depreciation”’, rationalization 8 or “test unfair’’, or rationalization 9 
or “instructions misunderstood”. To nullify ambiguity, rationalization 10 or ‘‘mis- 
cellaneous” was eliminated from the investigation. This left the following six ration- 
alization categories: (1) time, (2) nervousness, (3) carelessness, (5) poor motivation, 
(6) did best or no excuse, and (7) body aches and pains. 

The procedure described above was followed in three separate schools of similar 
population in order to check the consistency of the frequency of the various categor- 
ies of test behavior explanation or rationalizations. It can be seen from Table 1 
that the percentage of rationalizations secured in the three high schools shows ex- 


TaBLeE 1. PerrcentaGes oF Types oF RaTIONALIZATIONS SECURED FROM THREE HicH ScHoou 
Srupent PopvuLaTiIons 








School A* School B School C 





Rationalization 
Category Per cent N Per cent Per cent 


Time 49 y 41 48 
Nervousness 18 74 23 17 
Carelessness 16 58 18 19 
Poor Motivation 2 1 4 
Did Best 8 5 18 
Body Aches 17 11 38 


Total 724 163 100 324 





























*This school was used only to check frequency of responses. 
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tremely high consistency. ““Not enough time” was given as a rationalization to the 
extent of about forty-five per cent of the total reasons. “Nervousness” as a justifi- 
cation was presented to the extent of approximately nineteen per cent of the total 
rationalizations secured. Approximately eighteen per cent of the total reasons 
secured were categorized as ‘‘carelessness’’. Only one per cent of the student popula- 
tion gave a rationalization of “poor motivation’. Those who claimed they had 
“done their best”’ made up about four per cent of the total. Finally, those who used 
“body aches and pains” as a reason to justify their test behavior made up eleven per 
cent of the total sample. 

The two high school senior classes used in this study were compared as to the 
frequency of rationalizations expressed by the two sexes. As Table 2 indicates, fifty- 
seven per cent of the total populations were females and forty-three per cent were 
males. The ‘‘time” group showed approximately the same percentage distribution 
as the total distribution (critical ratio of percentages .80). The ‘“‘nervousness’’ group 
showed somewhat less similarity, but yet were not significantly different (critical 
ratio of percentages .99). The distribution showed that the males used ‘‘careless- 
ness” as a rationalization significantly more often than the females based upon the 
expected expectancy total distribution (critical ratio of percentages 1.89 or ap- 
proaching significance at the five per cent level). The male “poor motivation” group 


TABLE 2. PERCENTAGES OF TyPEs OF RATIONALIZATIONS BY SEX From Two Hiau ScHoots 








Male Female Total 
Rationalization 


Category ‘i Per cent N Per cent Per cent 











Time 40 100 
Nervousness 38 71 ‘ 100 
Carelessness i 53 50 100 
Poor Motivation 56 4 100 
Did Best ‘ 50 12 100 
Body Aches 43 37 100 


























Total 242 43 | 319 100 





did not occur significantly more often then the “poor motivation’ explanation for 
the females with reference to the total distribution (critical ratio of percentages 
.78). The male “did best” group did not occur significantly more often than the 
female with reference to the total distribution (critical ratio of percentages .67). The 
“body aches” group coincided in percentage distribution with the total distribution 
of boys and girls (critical ratio of percentages .00). 


ComPARISON WitH OTHER TEst RESULTS 


The significant differences between each of the rationalization groups and all 
the other rationalization groups combined on the temperament and mental abilities 
factors were determined in order to yield a particular pattern (or lack of pattern) 
for each category. 


The “‘time’’ group. This group of students, who complained about the pressure 
of time while taking tests, showed themselves to be significantly less active in their 
general overt behavior than all the other rationalization groups combined. The 
“time” group rated itself as being significantly less stable and more introverted in its 
thinking than all the groups combined. In other words, the last stable students used 
“time” as a reason for not doing well on examinations. Moreover, the ‘“‘time’’ group 
was significantly inferior on its earned scores on the verbal, spatial, reasoning and 
word-fluency factors than all the other groups combined. In general, the less stable, 
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the less active, the more meditative and the poorer students in the mental abilities 


offer pressure of time as an excuse for not doing as well as they could have done when 
taking tests. 


The “‘nervousness’”’ group. The group who gave “nervousness” as a reason for 
poor test behavior show no homogeneity when compared to all the other groups 
combined. No distinct patterns are discernable from the data. Students who gave 
“nervousness”’ as a rationalization for not doing well when taking tests indicated no 
significant characteristics on any of the seventeen factors utilized in this study. 


The “‘carelessness” group. The group who gave “carelessness” as a reason for 
inadequacy in taking tests indicate they are more ‘‘masculine”’ in attitude and inter- 
est than all other rationalization groups combined. The data suggest that those 
students who are more “feminine” than others in attitudes, tend not to rationalize 
test behavior by offering ‘‘carelessness’”’ as an excuse. Since, as Table 2 indicates, 
significantly more males than female students manifested this rationalization, super- 
iority in space perception for this category may be expected in view of the fact that 
in our culture boys are generally conceded to be “better” in spatial conceptions than 
girls. This ‘“‘carelessness” group is superior to all the others in number ability and in 
word-fluency. In general, the group that uses the rationalization ‘carelessness’ 
rates itself more masculine and scores higher on the space, number and word-fluency 
factors than all the other groups combined. 


The ‘‘poor motivation” group. This group, which was small in total percentage 
(see Table 2), nevertheless showed certain definite differences from all the other 
rationalization groups combined. Despite the group’s “‘don’t care” or “it makes no 
difference’ approach, the group rated itself as being introvertive in thought and 
decidedly lacking in fluency with words in comparison with all the other groups 
combined. Furthermore, this group was less able than the others to discern details 
speedily. The rationalization stated may reflect poorer communication ability on 
the part of this group than all the others combined. The data suggesting less “‘care- 


TaBLe 3. Critica, Ratios BETWEEN EacuH RATIONALIZATION GROUP AND ALL OF THE OTHER 
RATIONALIZATION Groups COMBINED ON THE VARIOUS TEsT Factors 








Test Factor Critical Ratios 
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lessness” in thought may explain the lack of speed in discerning perceptual details. 
In general, this group is introvertive in thought and inadequate in word-fluency 
and perceptual speed. 

The ‘‘did best’”’ group. This group is apparently heterogeneous in composition. 
Students using this rationalization showed no distinction of statistical significance 
from all the other rationalization groups combined. 

The “body aches” group. This group shows more temperamental differences than 
any of the other groups. The “complainer” of various body aches and pains indicates 
poorer sociability than all of the other rationalization groups combined. The mem- 
bers of this group are more emotionally stable and show greater aggressiveness than 
those in the other categories. Moreover, they are decidedly less introvertive in 
thought and more active overtly than the others. As far as the mental abilities are 
concerned, the members of this group are superior in word-meaning or verbal ability. 
The data suggest that, because of their superior activity rate, aggressiveness, ‘‘care- 
freeness”’ of thought, and verbal facility, members of this group may comprise many 
of the individuals who are “troublesome’”’ to test administrators. 


CONCLUSIONS AND SUMMARY 


The results of the present study suggest that explanations or rationalizations of 
test behavior are amenable to categorization and statistical manipulation. Further- 
more, a high degree of consistency exists in terms of the frequency of percentages of 
the kinds of rationalizations offered by students of the schools studied in this research. 

The only sex difference noted in frequency of rationalizations used was “‘care- 
lessness’’, which was used significantly more often by boys than girls. This may have 
a cultural basis. 

The ‘‘time’’ group was less stable, less active, more introvertive in thought and 
poorer in the verbal, spatial, reasoning and word-fluency factors of mental ability 
than all the other categories combined. 

The ‘‘nervousness”’ group showed no distinct pattern when compared to all the 
other groups combined. 

The “carelessness” group are more masculine and superior in spatial, numerical 
and word-fluency factors of mental ability. 

The “poor motivation” group showed poorer word-fluency and perceptual 
speed and is more introvertive in thought despite its “don’t care’’ approach. 

The “‘did best or no excuse” group indicated no significant differences from all 
the other groups combined. 

The “body aches” group showed poorer sociability, greater emotional stability, 
greater aggressiveness, less introversion in thought, a higher activity rate plus better 
verbal ability than all the other groups combined. 
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A QUANTITATIVE STUDY OF THE HTP AND ITS RELATIONSHIP TO 
THE WECHSLER-BELLEVUE SCALE": ? 


HAROLD RUBIN 
Veterans Administration Hospital, Lebanon, Pa. 


INTRODUCTION 


During the past two years, this writer has used the House-Tree-Person Test 
(HTP) devised by Buck as part of a routine admission test battery in a veterans 
hospital. Although devised as a projective device aimed at appraising total personal- 
ity structure, the HTP does have a carefully designed quantitative procedure which 
yields a number of IQ scores. While in one section of his manual Buck states that the 
HTP is not intended to be a test of “intelligence” and that no high correlations 
should be expected between it and other specific measures such as the Wechsler- 
Bellevue or Stanford-Binet, he claims in a later section that the test does measure 
“general intelligence’. An apparent inconsistency exists. Furthermore, Buck pre- 
sents tables showing rather high correlations between the HTP and W-B scores. 
This writer is of the opinion that Buck may be underestimating the possible effective- 
ness of the HTP as a measure of intellectual function in the interest of furthering 
its value as a projective device measuring total personality structure. 

Estimating intellectual level from drawings is certainly no new approach, but it 
became a more widely utilized method with the advent of the Goodenough Draw-a- 
Man-Test in 1926“). The method was, for many years, confined largely to drawings 
of children and it was only recently that this approach became more widely used at 
the higher age levels. The HTP is one of these more recent approaches and it is the 
aim of the present investigation to explore its value as a measure of adult intelligence. 

The only report this author could find in the literature evaluating the HTP asa 
measure of intellectual function is that of Sloan and Guertin “ in a study on mental 
defectives. These authors fail to find evidence for support of the hypothesis that the 
HTP is a measure of intelligence using the W-B as the criterion. These findings, 
however, represent a very limited population. The present study attempts to eval- 
uate this hypothesis with a wider range of subjects and with the additional aim of 
establishing its validity in diagnostic screening. For this purpose, the Wechsler- 
Bellevue Scale was selected as the most widely accepted measure of that function 
which we call ‘“‘intelligence.”’ 


PROCEDURE 


Subjects in the present study initially consisted of 135 patients admitted to the 
N. P. Service of a veterans hospital. Although specific diagnosis was not a criterion 
for selection, many of the subjects were psychotic. All testing was done within ap- 
proximately one month following admission to the hospital and included the HTP 
and Wechsler-Bellevue tests among others. Of this original group of subjects, twenty- 
seven were eliminated from the final group because their HT P’s were regarded as non- 
scorable for this study. They either failed to meet the minimal requirements for 
scoring as outlined by Buck? or they produced incomplete records by omitting one 
of the three drawings. The Wechsler-Bellevue tests for these subjects, however, 
were scorable. Although this procedure may be regarded as somewhat selective, it 
was felt that the nature of the investigation supported this approach. The remaining 
108 records were then utilized in this study. The subjects ranged in age from 18-50 
years with a mean of 30.2 and in educational level from four years to college graduate 
with a mean of ten years. Standard administrative and scoring procedures were 
followed at all times as outlined by Buck”. 


1Read in part at the meetings of the American Psychol ey Association, Cleveland, O, 1953. 
*From the Veterans Administration Hospital, Lebanon, 
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RESULTS AND DISCUSSION 


In Table 1 are presented the mean scores, standard deviations and ranges for the 
HTP and W. B. I1Q’s. It can be readily noted that both the % Raw G IQ and Net 
Weighted IQ scores of the HTP tend to be somewhat lower than either of the three 
WB IQ scores and also that the former have a slightly wider dispersion. An F test 
of the five mean scores failed to yield a significant difference. 


Taste 1. Means, StanpDARD DEVIATIONS an _ or HTP anp WEcHSLER-BELLEVUE IQ’s 
N=108 








Test Mean 8. D. Range 
HTP % Raw G IQ 95. 21.2 36 — 140 
HTP Net Wtd, IQ 92. 19.7 35 — 134 
WBFS IQ ; 16.5 55 — 127 
WBV IQ 100.29 17.6 56 — 129 
WEP IQ 102.56 14.9 62 — 133 

















TABLE 2. PEARSONIAN r’s BeEtwEEN HTP % Raw G, HTP Net Weiaurep IQ anp WECHSLER- 
BELLEVUE IQ’s (N =108) 








Wechsler- HTP Confidence HTP Net Confidence 
Bellevue IQ’s | % Raw G limits (P=.99) Weighted IQ limits (P =.99) 


Verbal .623 .44- .75 . .42 — .74 
.634 .46 — .76 : .39 — .72 





Performance 


Full Scale .674 .51 — .79 i .46 — .76 





| 
| 





Table 2 presents the correlations between the W. B. IQ scores and the HTP 
% Raw G 1Q scores. The confidence limits listed were established by means of the r 
to z transformation described in McNemar™. As can be observed, the % Raw G IQ 
correlates highest with the W. B. Full Scale IQ (r = .674), next highest with the 
W.B. Performance Scale (r = .634) and lowest with the W. B. Verbal Scale (r zamg* 
.623). The difference between these three sets of correlations are actually very small. 
These correlations, although not quite as high, compare favorably with those re- 
ported by Buck? on his standardization group. Buck found correlations between 


% Raw G I1Q’s and WB IQ’s to be .746 for the Full Scale, .724 for the Performance 
Scale and .699 for the Verbal Scale. 

The Net Weighted IQ was treated in a similar manner and the results are given in 
Table 2. Correlations between the WB IQ’s and the Net Weighted IQ were found to 
be .636 for the WB Full Scale, .604 for the WB Verbal Scale and .580 for the WB 
Performance Scale. These correlations are somewhat lower than those reported for 
% Raw G 1Q and hence of lesser value in predicting W. B. scores. 

These results would seem to indicate that the HTP does compare favorably with 
the Wechsler as a measure of intelligence thus contradicting the conclusions of Sloan 
and Guertin“. Some validity, then, may be attributed to Buck’s contention that 
the HTP is a measure of intelligence, at least to the extent that it is measured by the 
Wechsler-Bellevue Scale. These findings indicate that in a screening situation one 
may obtain a fair estimate of a subject’s level of intellectual function from the HTP. 
A more thorough evaluation, of course, would be obtained through administration 
of the Wechsler-Bellevue. 

To evaluate the extent to which the age correction factor on the Wechsler 
affects the relationship between W. B. and HTP IQ scores, the latter having no such 
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“correction’’, a correlation was computed between the Raw G IQ and Full Scale 
Weighted Scores. An r of .534 was obtained which is lower than that obtained be- 
tween Raw G and W. B. IQ scores (r = .674). This would appear to contra-indicate 
Buck’s presumption that the relationship between the HTP and Wechsler-Bellevue 
IQ’s would be depressed because of the age correction factor on the Wechsler. It 
does suggest, however, that the HTP score is not affected by advanced age to any 
appreciable degree and as such, adds to the stability of this measure. A comment 
might be made at this point that the HTP Raw G IQ bears a higher relationship to 
the WB Full Scale Weighted Score than at least three of the WB subtests: Digit 
Span, Picture Arrangement and Object Assembly. 

To further evaluate the quantitative scores of the HTP, all cases in which there 
was a disparity of 15 points or more between HTP and Wechsler IQ’s were explored 
to determine whether this difference was in some way related to intellectual level as 
established by the W-B. Fifty-three cases were found to reveal such a discrepancy. 
Of these, as observed in Table 3, sixteen were in the upper intellectual range, twenty- 
two in the middle range and fifteen in the lower range. This distribution tends to 
approximate the distribution of all subjects at these three intellectual ranges. It may 
be observed that in large part the differences were in the direction of lower HTP 
scores. Where HTP scores were higher than the Wechsler, they appeared in greater 
frequency at the lower intellectual levels. Sloan and Guertin “® report a somewhat 
similar finding with mental defectives. 


TABLE 3. FREQUENCY oF CasEs wITH 15 oR More Pornts Disparity BETWEEN 
HTP anv WEcHSLER-BELLEVUE [Q’s aT THREE INTELLECTUAL LEVELS 








Wechsler- HTP HTP 
Bellevue Higher Lower Total 


115 ond above 3 13 16 (30%) 
91-114 3 19 22 (42%) 
90 and below 5 10 15 (28%) 














A further analysis of the data in Table 3 revealed that in almost all instances 
the % Raw G IQ more closely approximated the W. B. Full Scale score than did the 
Net Weighted 1Q, irrespective of intellectual level. At the lower levels in particular, 
the weighted scores appeared to penalize the subjects heavily. This finding is in dis- 
agreement with Buck’s contention that in subjects of above average or superior abil- 
ity the Net Weighted IQ more closely approximates the subject’s level of intellectual 
function. It should be added, however, that there is one exception to this. In those 
records in which no “flaw” scores appear, the percentage method of arriving at the 
% Raw G 1Q yields a spuriously high score and in such instances, the Net Weighted 
IQ is a more accurate estimate of intellectual function. However, as Buck has in- 
dicated, the frequency of this occurrence is rare, with only two such records appear- 
ing in this study. 

More recently, Buck“ has attempted to ) develop score patterns indicating mal- 
adjustment based on what he terms ‘‘the quality of the quantity.”” One such pattern 
involves a comparison between Raw G and Net Weighted IQ’s. According to Buck, 
“The Raw GIQ... seems largely to represent the quantity of a subject’s creativity 
and productivity; to emphasize how much he does. The Net Weighted IQ. . . seems 
to emphasize the quality of the subject’s concepts; to stress how well the subject does 
something.” A discrepancy of fifteen or more points in favor of the Net Weighted 
IQ is regarded as indicating that the subject is functioning below the qualitative 
level of which he is capable. On the other hand, if the “% Raw G IQ is 15 or more 
points above the Net Weighted IQ, ‘‘pathoformicity is definitely to be suspected, and 
with the quality of the behavior that much below quantity, regression is implied. . ” 
Comparisons between the Raw G and Net Weighted IQ’s of the 108 subjects studied 
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revealed that only 7 had differences of fifteen or more points, all in the direction of 
the Raw G score. Differences of ten or more points were found in nineteen patients 
with only 2 in the direction of the Net Weighted IQ. The mean difference between 
Raw G and Net Weighted IQ’s was 6.8. Since the subjects in this study were all 
psychiatric patients, it would be expected that if this pattern were valid more than - 
seven patients should have been identified by this discrepancy. Thus, these findings 
question the validity of such a pattern analysis with the HTP. 


SUMMARY 


The HTP was evaluated with respect to its effectiveness as a measure of in- 
tellectual function. Subjects were 108 patients admitted to the N. P. Service of a 
veterans hospital. With the Wechsler-Bellevue as the criterion, results revealed the 
HTP to be a valid measure of “intelligence.” The % Raw G IQ had the greatest value 
in this respect. Buck’s score pattern proposed to indicate maladjustment did not 
identify this group of psychiatric patients as “‘maladjusted.” 
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THE USE OF CARTOONS AS A PROJECTIVE DEVICE 


GEORGE B. STROTHER, MARGARET MCCRACKIN BARNETT 
AND PETER C. APOSTOLAKOS 


University of Minnesota, Duluth Branch 


INTRODUCTION 


Many psychologists accept Freud’s theory that comic situations owe much of 
their effect to the fact that they release repressed reactions in a socially acceptable 
way. The authors were impressed with the extent to which this seemed to be true of 
the series of cartoons by William Steig published under the title The Lonely Ones. © 
The subsequent appearance of these cartoons on cocktail napkins, ashtrays and 
cocktail glasses may, through wider dissemination, have reduced their potential 
value as a clinical instrument but it has at the same time assured their use under con- 
ditions whereby maximum catharsis might be obtained. Since the experimental 
trials for the present study antedated the use of these cartoons as an adjunct to the 


cocktail hour we need not be concerned with the effect thereof as far as this study 
goes. 


PURPOSE 


_ The purpose of this study was to determine whether such cartoons as Steig’s 
might be used as the basis for a projective technique. As a means of determining the 
subject’s reactions to the cartoons, several possibilities were considered. Finally a 


preferential sorting was chosen as the simplest means of determining individual 
reactions. 
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METHODS 


Preliminary studies were done with 60 subjects, all college students, comparing 
two different sorting procedures. Both procedures involved having the subjects sort 
the cartoons into five categories. The first procedure involved sorting them into 
“‘like-dislike” categories and the second into “funny-not funny” categories. The 
cartoons had been removed from the book and mounted on stiff cardboard with the 
captions below. Each was numbered, the numbers corresponding to their serial order 
in the book. These first runs were all individually administered. The question was 
also raised whether the order of presentation might have some effect and runs were 
tried in a 1 to 46 order and in a 46 to 1 order. No difference was found in the ten- 
dency of subjects to use any one category systematically as a function of the order of 
presentation. 

The cartoons were also presented to a group of 101 subjects by means of an 
opaque projector and responses were recorded by the subjects on an answer sheet. 
Subjects were instructed not to laugh at or comment on the cartoons so that other 
members of the group would not be influenced by their reactions. As a result of these 
preliminary runs, it was decided to use group presentation and to use the “funny-not 
funny” dimension. This was selected because it appeared to give subjects a feeling 
of making objective judgments whereas the preferential sorting by likes and dis- 
likes was construed as a more personal matter. The five categories employed were as 
follows: (1) very funny, (2) funny, (3) indifferent, (4) not funny, and (5) disgusting. 
All subjects seemed to accept these categories and to use them as points on a scale. 
Responses were recorded by each subject on an answer sheet with the five choices in 
a row following the item numbers. Subjects were instructed to look at each cartoon 
and classify it into one of the five possible categories. Thirty seconds proved to be 
sufficient exposure for each cartoon. 

The subjects used in this part of the experiment were 101 male and female col- 
lege students enrolled in an introductory psychology course. This group of students 
had taken several tests and several of these were selected as possible criteria. These 
were (1) nine scales of the Minnesota Multiphasic Personality Inventory, (2) ratings 
of tendency toward psychosomatic illness based on evaluation of student health 
records, (3) the Schneider “index of autonomic efficiency” “>, and (4) scores of the 
“Reporting Test’: *), The psychosomatic ratings were based on a five point scale: 
a score of one indicated a health record with no indication of psychosomatic illness, 
a score of five indicated the presence of a clearcut chronic complaint such as recur- 
rent bronchial asthma or several less severe but recurrent disturbances such as hives, 
hay fever and digestive complaints. The “Reporting Test’ is a group frustration 
test described in detail in the articles previously referenced. High scores on this test 
indicate minimal disruption of test performance following frustration. 

MMPI scores were available on all the 101 subjects. Psychosomatic ratings, re- 
porting test scores and Schneider indices were available on 90, 52 and 58 subjects 
respectively. 

The procedure for determining the relationship of cartoon reactions to each of 
the criteria was to compare the top 30% of the subjects with the bottom 30% for 
each of the criteria except the reporting test and Schneider groups wherein compari- 
sons were made between the high and low halves of the groups. These high and low 
criterion groups were compared with respect to their distribution in each of the five 
choice categories; chi-square tests were applied to test the extent to which each 
of these pairs of distributions for each cartoon failed to meet the hypothesis of homo- 
geneity. 

RESULTS 


Table 1 lists each of the most significant cartoons for each of the criteria to- 
gether with their probabilities of being true differences. Although it is not feasible 
to present the actual cartoons, reference to the captions will reveal that the cartoons 


showing maximum differences seem generally to be consonant with the supposed 
significance of the various criteria. 
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TABLE 1. List or THE Most SIGNIFICANT CARTOONS FOR EACH OF THE CRITERIA 








Criterion Cartoon No. Caption 





Hs 22 I can laugh at myself 
38 Why pick on me 
D 14 Who are all those others 
Hy I do what is expected of me 
Mother loved me but she died 
Let the leader of the cult do the worrying 
Public opinion no longer worries me 
You've ruined my life 
Revenge is sweet 
I do not forget to be angry 
(see above) 
But I can mystify and terrify 
Pt (see above) 
Se (see above) 
Ma My true love will come some day 
(see above) 
I have my own troubles 
Schneider index I think before I act 
Convention be damned 
You'll have to take me as I am 
I can’t express it 
Reporting test I am blameless 
(see above) 
Psychosomatic rating (see above) 
33 I recreated myself 
Pd (see above) 
(see above) 





The fact that the probabilities in Table 1 do not often reach a high level of sig- 
nificance is not surprising. Single items of this type rarely would with samples of 
this size and with criteria whose validities are themselves problemmatical. Inspec- 
tion of those items which do approach or reach a satisfactory level of significance sug- 
gests considerable face validity, particularly in view of the lack of any connection 
between the variables which might give a spurious relationship (such as intercorrela- 
tions between two inventories containing a number of similar or identical items). 
For example, the cartoons which differentiated subjects with high paranoid scores on 
the Multiphasice with the possible exception of number 35 are ones which most clin- 
icians would be apt to rate as loaded with a paranoid connotation. The captions are 
self-explanatory in the cases of numbers 17 and 25. Number 46 shows a woman with 
very large eyes half concealed behind a screen on which a large eye is drawn. 

Number 18, for another example not explained by the caption, rates high on the 
manic scale; it is a picture of an endomorphic man beating a large base drum. 

In spite of the encouragement that the foregoing results gave the experimenters, 
it could not be said that the results thus far were by any means conclusive. The next 
step in the verification of the original hypothesis that certain types of cartoons might 
lend themselves to use as a projective device was to construct an experimental 
“test” based on the foregoing data. This was done in the following way. For each of 
the criteria a per cent difference between the high and low groups was computed for 
each of the five sorting categories. If the excess per cent favored the high group 
the sign of the difference was plus; if it favored the low group, minus. All differences 
were divided by 10 and rounded off to the nearest whole number. Thus a per cent 
difference of 4 received a value of zero, a difference of plus 27 a value of plus 3, etc. 
The values thus determined were considered as scoring weights. 

The cartoons were then presented in group form to a new group of 31 subjects, 
also students in an introductory psychology course and their ‘‘scores” on each of the 
variables were computed. Table 2 gives the odd-even reliabilities corrected by the 
Spearman-Brown formula for each of the ‘“‘scales”. These reliabilities indicate that 
the subjects’ reactions to the cartoons do tend to form stable clusters which in some 
cases approximate satisfactory test reliabilities. 





THE USE OF CARTOONS AS A PROJECTIVE DEVICE 


TasBLe 2. Opp-Even RELIABILITIES FOR THE EXPERIMENTAL CARTOON ScALEs! 


Hs .67 Sc .79 
Hy 82 Ma .73 
Pd .80 D .56 
29 
52 








Mf Schneider .43 
Pa Reporting test 41 
Pt .40 Psychosomatic rating 





1Corrected by the Spearman-Brown formula 


An attempt was also made to determine whether these measures were reason- 
ably independent of one another. Table 3 gives the intercorrelations between the 
various cartoon “‘scales’’. In some cases the intercorrelations are sufficiently high in 
view of the reliabilities to indicate that certain of the scales actually measure much 
the same thing. On the other hand some of the intercorrelations are low enough to 
indicate that several different factors are operating. Table 4, which is a table of inter- 
correlations between some scales of the Multiphasic for the original group of 101 
subjects, indicates that the intercorrelations in Table 3 reflect to a large extent a lack 


of independence between the scales of the Multiphasic itself and do not necessarily 
inhere in the cartoons. 


TABLE 3. INTERCORRELATIONS BETWEEN THE CARTOON “‘ScaLzEs’’. (N 29) 








Factors Hs Hy £=Pd Pa Se Ma Mf Pt D_ Sch 





Hs + eee a. ~08 -3 .55 .49 -.01 
Hy ‘ See .07 -.57 .59 .48 .26 
Pd <i 04 -.13 .55 .49 .14 
Pa ; -.20 -33 .59 .15~ .10 
Sc 01 -.32. . 19 .20 
Ma ye ee 04 8.25 
Mf .53 
Pt -26 = =.43 


D 37 
Sch 


R. T. 
Pm 








TaBLe 4. TABLE OF INTERCORRELATIONS OF SELECTED SCALES 
OF THE MINNESOTA Muttipnasic (N 101) 








Factors Hy Pd Pa Se Ma 





Hs 73 .70 .63 .74 -.08 
84 


Hy — ae 
Pd 76 «6.83 = .04 


Pa .69 -.20 
Se .O1 


Ma 











42 G. B. STROTHER, MARGARET MC CRACKIN BARNETT, P. C. APOSTOLAKOS 


As a further check on the consistency of the cartoons in predicting the criteria 
a small sample of 15 subjects was obtained on whom Minnesota Multiphasie scores 
were available. Rank difference correlations were computed on the cartoon scales vs. 
the corresponding sections of the Minnesota Multiphasic. Table 5 gives the values 


TABLE 5. INTERCORRELATIONS BETWEEN CARTOON SCALES AND 
CoRrRESPONDING MMPI Scares (N 15) 








D Hy Pd Mf Pa Pt Se Ma 
50 .43 .78 .61 .66 .46 .70 .81 





obtained. These coefficients seem to indicate that the cartoon scales predict the 
criteria with a fairly high degree of accuracy. It seems reasonable to expect that 
were the other three criteria available on this small group similar correlations could 
be anticipated judging from the reliabilities obtained. 

Blind analyses of cartoon scores have subsequently been undertaken with a 
small number of psychiatric patients and writeups based on these blind analyses 
have been matched with patients by a psychologist and psychiatrist who were fam- 
iliar with the patients. The number of cases has been too small to reach a significant 
level of probability but appear to exceed chance expectancy. 


DIscussION AND SUMMARY 


The purpose of this study was to investigate the hypothesis that cartoons would 
prove useful as the basis for a projective technique. The cartoons selected for in- 
vestigation were William Steig’s The Lonely Ones, which appeared especially prom- 
ising. It was found that certain cartoons differentiated between high— and low— 
scoring groups of subjects determined by the Minnesota Multiphasic, a rating of 


tendency toward psychosomatic illness, an “index of autonomic efficiency,” and a 
test of response to a frustrating situation. 

An experimental ‘‘test’’ was constructed based on these data. Per cent differ- 
ences between the high and low groups of subjects on each criterion were used as the 
basis for determining scoring weights. The cartoons were then presented to a new 
group of subjects and reliabilities of the cartoon “‘scales’’ were computed. The re- 
sults indicate that the subjects’ responses to the cartoons tend to form internally 
consistent clusters which are in some cases more or less independent of one another. 
The interdependence found in other instances appears to reflect in part at least the 
interdependence of the criteria and does not necessarily reflect anything inherent 
in the cartoons. 

The criteria selected have no particular virtue. They were chosen chiefly be- 
cause of their availability and are justified solely on the basis of the original assump- 
tion that if such criteria showed positive results the original hypothesis would be 
verified. Other clinically or experimentally better criteria would almost certainly 
yield better results. 


Item analysis and other refinements would doubtless improve the results ob- 
tained in the foregoing. 


It is concluded that objectively scored judgments of cartoons have potential 
value as a projective technique. 
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A PRELIMINARY REPORT ON A TECHNIQUE DESIGNED TO 
DIFFERENTIATE PATIENTS WITH CEREBRAL PATHOLOGY AND 
PSYCHONEUROSIS* 


ALEXANDER TOLOR 
Neurological Institute, Presbyterian Hospital, N. Y. C. 


PROBLEM 


Psychometric techniques that are able satisfactorily to differentiate between 
patients with cerebral abnormality and psychoneurosis are of great value particularly 
because of the similarity of symptomatology in these two categories. Also, the 
possibility of early detection in patients of a cerebral involvement before obvious 
clinical manifestations develop, and the possibility of altering the course of the 
disease by instituting early therapeutic measures, suggests that this area is an im- 
portant one for further investigation. Of the numerous methods used by psycholo- 
gists for neuro-psychiatric diagnosis, such as the analysis of performance on sub- 
tests of the Wechsler-Bellevue scale, the Rorschach inkblot test, projective drawings, 
and Bender-Gestalt Test, none is without serious limitations and deficiencies. The 
present report describes a very brief and economical instrument that may be easily 
adapted as one test in a psychological battery, in conjunction with other laboratory 
techniques, to assist in a differential diagnosis. The purpose of this investigation is 
to determine in what ways spontaneous colored chalk productions differentiate be- 
tween patients with psychoneurosis and those with cerebral pathology. 


METHOD 
The materials used were white paper, size 844” x 11”, an ordinary stop watch, 


five colored chalks, blue, green, red, yellow, and black.! The paper was placed on the 
table directly in front of the patient with its longer axis oriented from left to right. 
The colored chalks were then arranged on the table in the following order: blue, 
green, red, yellow and black, reading from left to right. The examiner carefully 
named each color as he placed the corresponding chalk on the table in order to make 
certain that the patient became thoroughly familiar with all of the elements with 
which he was being confronted. 

The following instructions were administered orally: ‘‘Using one or more of 
these colored chalks fill in the space on this paper completely. Make sure that you 
leave no white space. Remember you may do anything you like and use as many or 
as few of these colors as you wish as long as you leave no white space.” 

If the patient raised any questions they were answered in a manner consistent 
with the above instructions. 

On his record sheet the examiner noted the name of each color as it was used 
and the time consumed for each from first contact with the paper to the return of the 
chalk to the table. Occasionally patients selected and disposed of chalks so rapidly 
that it was virtually impossible to obtain an accurate time measure, and in those 
cases a recording of “less than 5’’ was made. Two such “‘less than 5” recordings were 
taken to be equivalent to 5 seconds, and all time recordings were rounded off to the 
nearest 5 second interval. 

In addition to these notations the examiner recorded various behavioral ob- 
servations such as blushing, nail biting, humming, wiggling about in the seat, as well 
as any verbal comments made. The latter often reflected the patient’s reactions 
toward specific colors yielding important clues concerning his expression of affect. 


*The author wishes to express his indebtedness to Miss Louise Hewson, Director, Psychology 
a Neurological Institute, without whose guidance this work could not have mM ac-. 
complished. 

1All of the surfaces of the chalks were exposed and therefore available for use. Also all of the 
colored chalks were approximately equal in size, 3’’ having been found suitable, some allowance being 
made for differences in sizes resulting from the greater use of some of the chalks compared to others. 
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Discrepancies between the patient’s announced plans and actual performance were 
recorded. Occasionally a patient requested other colors or permission to leave white 
areas undone. The examiner noted if there was an unusual degree of conflict mani- 
fested in selecting colors and observed how impulsive, deliberate, or hesitant the ap- 
proach was. Other observations concerned the obliteration of certain colors, the 
pressure with which the colors were applied, the direction of approach, rotation of 
paper, and the degree of literalness with which instructions were followed. Some pa- 
tients, for example, left considerable space undone, whereas others actually marked 
up the table top in an attempt to comply with the instructions. Further pertinent 
observations, similar to those the clinician makes when more conventional projective 
tests are administered, yielded significant clinical information. These incidental ob- 
servations, although important for personality’ evaluation, were not subjected to 
analysis for the purposes of this investigation. 


SUBJECTS 


The colored drawings examined for this study came from the productions of a 
group of adult patients seen at Neurological Institute, Columbia-Presbyterian Medi- 
cal Center who were referred to the Psychology Department usually for the purpose 
of a differential diagnosis. The sample consisted of 38 patients, 21 later diagnosed as 
suffering from functional disorders (psychoneurotic group) and 17 later diagnosed 
as having a cerebral involvement. The criterion used was the final discharge diag- 
nosis, whenever available, based upon a variety of laboratory techniques such as 
electroencephalogram findings, neurological examination, pneumoencephalogram 
results, skull X-rays, and psychological examination. Whenever discharge diagnoses 
were not available, as was the case especially with clinic patients, the criterion was 
the impression obtained from the psychological examination which included usually 
the Wechsler-Bellevue scale, Substitution Test, Bender-Gestalt Test, projective 
drawings, and Rorschach inkblot method. In the interpretation of the psychological 
findings the examiner relied heavily on the pattern of deviation ratios on the Wech- 
sler-Bellevue Scale and the Substitution Test as they are used by Hewson“? as aids 
in neuropsychiatric diagnosis. 

The psychoneurotic group ranged in age from 16 to 60 years, mean 37.5 years 
(S. D. 10.6). Their range of intellectual functioning extended from the low average 
classification, an IQ of 86, to the superior classification, an IQ of 124. The mean 
quotient was 107 (S. D. 11.2). The organic group had an age range of 15 to 58 years 
with a mean age of 43.4 years (S. D. 11.4). In intelligence they ranged from a quo- 
tient of 70 to 110, the mean quotient being 94.9 (S. D. 11.7). Both sexes were repre- 
sented but not in equal numbers since there were 63.2% females in the psychoneu- 
rotic group and 30.8% in the organic group. 


RESULTS 


An analysis of 6 variables was made in comparing the performance of the two 
groups on the colored drawing test. The variable which seems most obvious and 
which can be most easily examined is the content of the drawings. In comparing the 
content of the spontaneous drawings, it will be seen from Table 1 that 94.1% of the 
patients with cerebral pathology produced horizontal or vertical colored strips or 
single-colored, undifferentiated wholes. On the other hand, only 19.1% of the pat- 
ients with psychoneurosis made productions of this kind. The majority of psycho- 
neurotics produced blended designs of various colors, objects, scenes, diagonal or 
wavy designs, abstract designs, or people, here designated as content of the Type 1 
class.? 


*It is interesting that with two patients who were not included in the present sample because of 
an ——— discharge diagnosis; i.e., in both instances a strong possibility of a lesion without definite 
roof was indicated in addition to psychoneurosis, the patterns of the drawings were also equivocal. 

oth productions lacked the form of a definite design and the irregular colored areas, although not 
continuous strips, seemed to approach a striplike pattern. Both patients had been thought to have 
cerebral involvements on the basis of psychometrics alone. 
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TaBLe 1. A CoMPARISON OF THE CONTENT OF COLORED PropucTIONs oF A GrRouUP OF PATIENTS 


WITH PsYCHONEUROSIS AND WITH CEREBRAL ABNORMALITY 








Content 





Type 1* 


Scenes Diag- 
(%) onal 
or 
wavy 
de- 
signs 
(%) 


Type 2 


Undif- 

fer- 

enti- 

ated 
one 

color 

“I 
%) 


4.8 





Diagnostic 
group 


Blend- Ob- 

ed de- jects 

signs (%) 
(%) 


Ab- pig — 
a Pe 
0) Tn pe 


ens 
tent 


(%) 


Hori- 
zon- 
tal 

or ver- 
tical 


wCs) 


14.3 





Neurotics 
(N =21) 


4.8 14.3 4.8 81.9 


Organics 


(N =17) $9 — 5.9} 58.8 35.3 94.1 











*A considerable number of drawings of the psychoneurotic group could have been classified in 


more than one “Type 1” category, but in order to simplify the analysis each production was placed in 
only one such category. 


The differences between the two groups in average time consumed in drawing 
on the test as a whole as well as for each color used were computed. Although these 
trends are not statistically significant (p=.2 in each case), the neurotic patient tends 
to spend a greater amount of time on the task as a whole (average time 165.2 sec- 
onds) than does the patient with a cerebral involvement (average time 135.5 sec- 


onds). The organic patient, however, tends to consume a longer period of time for 
each color which he uses (mean time of 83.0 compared to 55.3 seconds). These find- 
ings seem to suggest that the neurotic patient generally selects a greater number of 
chalks of different colors than patients with a cerebral pathology. The Chi-square 
analysis in Table 2 illustrates that the two groups differ significantly in the total 
number of chalks of different colors used. 


TaBLE 2. A CoMPARISON OF THE TOTAL NUMBER OF DIFFERENT Coors UsEp By 21 NEUROTIC AND 
17 OrGanic PatTIENtTs 








Number of Colors 


Frequencies 





Neuroties (%) 


Organics (%) 





4.8 
19.0 
19.0 

4.8 














35.3 
5.9 
5.9 
11.8 
41.1 





9.2 05 











The fifth variable analyzed involved the kind of colors selected first. The re- 
sults presented in Table 3 indicate that patients with pathological cerebral condi- 
tions tend to use the color blue much more frequently, and the color black much less 


frequently than expected on a theoretical prediction of equal frequency of usage of 
all five colors. 
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TasLe 3. Tue FREQUENCY oF UsaGcE or Eacu or 5 Coors As A First AND AS A LAst CHOICE FOR A 
Group or 21 PaTrEnts WITH PsYCHONEUROSIS AND 17 PATIENTS WITH CEREBRAL PATHOLOGY 








Blue Green Red Yellow’ Black | X? p idf. 





First choice 
Psychoneurotics 
Observed frequency 


Expected frequency 





Patients with : 
cerebral abnormality 
Observed frequency 


Expected frequency 





Last choice 
Psychoneurotics 
Observed frequency 


Expected frequency 





Patients with ; 
cerebral abnormality 
Observed frequency 6 4 3 2 


Expected frequency 3.4 3.4 3.4 3.4 3.4 

















The neurotic patient also, as can be seen from Table 3, tends not to use all colors 
equally frequently as his first choice. The colors blue and black both tend to be more 
often selected first than expected on a theoretical basis of equal frequency. 

Not only may a patient’s first selection of a particular color be of special sig- 
nificance in his dealing with emotional material, but his last selection may also be of 
particular importance. Accordingly a similar analysis was done for the last choice 
and is summarized in Table 3. The distribution of frequencies is found not to be 
significantly different from chance expectations for either group and therefore this 
measure fails to differentiate between the two groups. 


DIscussION 


The most significant finding in this pilot study is that patients with emotional 
disturbance are able to organize colored materials, which are assumed to be related 
to an individual’s emotional drives, on a more complex level than the organic is able 
to do. The technique used, although similar in certain respects to the Rorschach 
test, differs in that the patient is required to make not only selective and associative 
responses of a perceptual type to colored stimuli, but he is also asked to actively 
organize and construct something by altering the environment. As Fortier®) sug- 
gested, in his discussion of the Mosaic Test and easel-and finger painting techniques, 
this involves the making of a plan and its execution through actual manipulation of 
objects. It is for this reason that the present technique is regarded as especially 
sensitive for diagnostic purposes. 

The patient with cerebral pathology appears to react either with emotional 
diffusion, as seen in his tendency to often use one colored chalk to fill in the entire 
page, or with excessive stereotype and “stimulus boundness’’, as reflected in his 
tendency to make simple colored strips. As Goldstein“ has indicated, the organic 
suffers from an impaired ability to abstract and is overly concrete. Thus parallel 
to the observed behavior in dealing with colored material goes the organic’s reduction 
of ego control over affect and his characteristic dependency upon immediate en- 
vironmental stimulation. The neurotic, on the other hand, although he may over- 
react to emotional situations and be driven by conflicting emotions, does not usually 
manifest the same fundamental inability to control affect. 
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SUMMARY 


Numerous devices and investigations of the validity of psychological techniques 
aimed at detection of pathological cerebral functioning abound in the literature 
(1, 3,5, 6). The present investigation was designed to determine in what ways patients 
with cerebral abnormality and those with psychoneurosis differed on a spontaneous 
color drawing test. It was assumed that under the impact of highly emotional 
material, characteristic impairment of visual-motor organization would be most 
evident. The six variables analyzed were (1) content of drawings; (2) time consumed 
in actual color usage; (3) average time for each color; (4) number of colors used; (5) 
colors selected first; and (6) colors selected last. The most significant finding refers to 
the high percentage of patients with cerebral pathology who produce either a series 
of simple colored strips or use one color in completing the entire surface of the page. 
The neurotic patients, on the other hand, tend to organize their work into higher 
level designs. There is very little overlap between groups. Because of the relatively 
small number of patients used, and the fact that the groups were not strictly com- 
parable because of the differences in age and intelligence, these preliminary results 
are merely regarded as trends to be cautiously interpreted and verified with larger 
populations. 
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A PRELIMINARY STUDY TO DEVELOP A MORE DISCRIMINATING 
F+ RATIO 


MARVIN J. FELDMAN, CAROLYN GURSSLIN, MARVIN L. KAPLAN AND NIDIA SHARLOCK 
University of Buffalo 


INTRODUCTION 


While Rorschach “? suggests in his original monograph that form level is related 
to ego control, the explicit interpretation of the F+ ratio as a measure of ego 
strength was made by Beck“. His interpretation received some experimental con- 
firmation from a study by Williams“? which demonstrated that F+ could be used 
to predict the reactions of normal subjects to a highly stressful situation. Most ex- 
perienced Rorschach workers, however, have probably noted that many neurotic 
subjects achieve F+ ratios in the optimal range between 80-90%. Certainly these 
patients would not be characterized as having strong egos; hence, the general appli- 
cability of the ratio would seem to be limited. The records of neurotic subjects with 
high F+ ratios usually contain a preponderance of easily perceived forms, i.e., pop- 
ular and vague concepts which still receive good form ratings. 

Estimates of ego strength based on F+% could be more valid if it were possible 
to separate those clinical patients with spuriously high ratios from the total group of 
subjects with optimal ratios. The present study undertakes to devise such a more 
discriminating F+ ratio. 
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Form LEVEt Ratios 


At present, a number of F+ ratios are used which vary slightly and probably 
intercorrelate highly. Thus Beck’s ratio considers only pure form responses and can 
be written: 
> pure F+ 


Beck F+% = 
a pure F+ 
It should be noted, however, that animal and inanimate movement responses receive 
a pure F score in Beck’s system. 
Hertz makes use of three F+ ratios called “R+”’, “F+ Rorschach”, and “F+ 
primary”. In this study only the R+ ratio will be used for comparative purposes. 


This ratio can be written: 
Hertz R49 = Silt responses with form 


all + responses with form 





Thus R-+ includes all responses in a given record excepting formless concepts such 
as C, K, ete. 

To better understand the construction of the new ratio, it should be noted that 
responses receiving ‘‘plus’” form ratings in Beck’s® or Hertz’s® tables can be furth- 
er defined into the following four classes: 


1. Originals: infrequent responses listed in Hertz’s tables supplemented by 
judgment of examiner. 


2. Usuals: responses given too often to be originals but not often enough to be 
populars. These responses make up the majority of responses in most normal 
records. 

3. Populars: very frequent responses as listed in Hertz’s tables. 

4. Vagues: responses which inherently have indefinite form are scored as 
vagues, e.g., clouds, smoke, indefinite maps, and x-rays. Some concepts which 
usually have definite form may be scored as vagues when they are not clearly 
delineated, e.g., a human figure seen indistinctly in the distance. Conversely 
some concepts which usually have indefinite form may be clearly articulated 
and would not be scored as vagues, e.g., a cloud which takes a human or ani- 
mal form. 


The numerator of Hertz’s R+ ratio includes all four types of “‘plus’” responses. The 
numerator of the proposed new ratio eliminates the latter two categories but uses the 
same denominator as in the R+ ratio. The new ratio would be written: 


Feldman’s F+% = all + form originals and usuals 
all + responses with form 





The writers believe that there will be fewer “false negatives” using this ratio than is 
true of older ratios, i.e., fewer neurotic subjects will obtain ratio scores within op- 
timal limits. It should be apparent that the new ratio is computed in such a way 
that it is quite possible to obtain a 100% Beck F+ or Hertz R+ score and still be 
close to zero on the new ratio. This lack of interdependence between the new and old 
ratios enhances the possibility that the new ratio will yield results different from the 
older ratios although it doesn’t guarantee that such will be the case. 

Because the present study is limited in scope, it cannot clearly demonstrate the 
superiority of the new ratio over the older ones. The aim of this study, rather, is to 
demonstrate that the new ratio yields different results with clinical subjects than do 
either of the other two ratios. This finding would seem to be a necessary although 
not sufficient condition if the new ratio is to be a more effective predictor of behavior 
reflecting ego strength than its predecessors. Thus two results anticipated with the 
new ratio can be stated as hypotheses: 
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1. The scores of the clinical group will be significantly lower than those of the 
normal group on the new F-+ ratio. Neither of the other two ratios however 
will differentiate between the scores of normal and neurotic subjects. 

2. The correlation between scores on the new and the older ratios will be sig- 
nificantly higher in normal than in neurotic subjects. 


Verification of the second hypothesis is necessary if it is to have an interpretation 
different from the older ratios, but verification of the first hypothesis actually offers 
some presumptive evidence for the greater efficacy of the new ratio. 


SUBJECTS 


The sample populations consisted of 53 normal and 30 psychoneurotic subjects. 
Although perhaps adequate for the purposes of this preliminary study, the normal 
group was far from an ideal sample consisting as it did of practice cases obtained by 
members of the senior author’s Rorschach class. To purify the normal group, cases 
were excluded who were or had been under psychiatric care and to make the group 
roughly comparable to the neurotic group, a lower age limit of 19 was set. The other 
group was composed of cases drawn from the files of a VA mental hygiene clinic and 
diagnosed as psychoneurotics.* 

The two groups were not matched as closely as might be desirable. Table 1 in- 
dicates that the two groups were quite different with respect to education and aver- 
age number of Rorschach responses. It cannot be decided upon an a priori basis 
whether or not a systematic bias distorts the results and in what direction such bias 
might operate as a consequence of the differences in education and number of re- 


sponses. However, the results of this study must be interpreted with caution until 
confirmed by other studies. 


TaBLE 1. A CoMPaRISON OF NORMAL AND Neurotic Groups oN AGB, EpucaTION, AND NUMBER OF 
RorscHacH REsPONSES 











Average Ave 
Average Age Education No. of 


Group x 8 z 8 x 








Normal 31.4 9.8 14.0 4.3 31.5 
Neurotic 36.4 13.6 10.2 2.4 24.5 


1.73 16.0 2.39 
08 -0001 01 





























PROCEDURE AND RESULTS 


The records of the two groups were scored for form level with the aid of the 
Hertz frequency tables“. All “plus” responses were further designated as either 
original, usual, popular, or vague. A simple check was made on the reliability of 
scoring. On a random basis, 100 responses were selected from the records of the 
normal subjects. Six of the ten test administrators then scored this standard series. 
There was absolute agreement on 78% of the scores, which is well above chance ex- 
pectation and affords some evidence for believing that the reliability of the scoring 
of individual records was adequate for the purposes of this study. 

The first analysis compared the average scores between normal and neurotic 
groups on the three form level ratios. Table II seems to bear out the first hypothesis 
since neither Beck’s nor Hertz’s ratios discriminate between the two groups while 


data from The Veterans Administration Regional 


1In his position as consultant in clinical peychology, the senior author obtained a portion of his 
ce, 


Buffalo, New York. 
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TaBLE 2. ComPaARISON BETWEEN NORMALS AND NEUROTICS OF THE 
AveRAGE Scores ON THREE F+ Ratios 








Ratio N t 
Hertz R ad 


Normal 


p(one tailed test) 





82.3 


Neurotic 85.1 
Beck F+ 
Normal 83.3 


Neurotic 85.1 | 1.14 


Feldman F+ 


Normal 56.8 


47.7 


10.8 


Neurotic 14.2 




















the new F+ ratio discriminates in the expected direction. The present data also 
furnish a start on the collection of normative data for the new ratio. The optimal 
range of scores on older ratios is generally considered to be between 80-90%. Scores 
on the new ratio, however, tend to be considerably lower than scores on older ratios; 
thus necessitating a somewhat different frame of reference for interpretations. Asa 
first approximation, it would seem that the optimal range for the new ratio might 
be between 50-65%. However, these limits should be considered as highly tentative 
because of the inadequacy of the present normal sample. The test of the second 
hypothesis was based upon the infercorrelations of the three ratios in each group. 
The results shown in Table 3 tend to bear out the second hypothesis. The new F+ 
ratio correlates higher with each of the older ratios in the normal than in the neurotic 
sample. “r’s” is not large enough, however, to be statis- 


The difference between “‘r’s 
TABLE 3. INTERCORRELATIONS OF THREE F+ Ratios In NoRMAL AND Neurotic Groups 








Groups 


Feldman F + 
vs. 
Hertz R+ 


Feldman F+ 
vs. 
Beck F+ 


Hertz R+ 
vs. 
Beck F + 





Normal (N =53) 
Neurotic (N =30) 


-61 
.25 
* 


.37 
17 


84 
.87 














*Difference between r’s significant at .05 level. 


tically significant when considering the new ratio and Beck’s F+, although the 
difference attains significance when comparing the correlations of the new ratio and 
Hertz’s R+. At any rate, the low magnitude of correlation between the new ratio 
and each of the older ratios in the neurotic group indicates that it is possible to ob- 
tain quite different results by using the new ratio. While the new ratio might yield 
results comparable to R+ in normal groups, it does not seem to have much in com- 
mon with Beck’s F+ ratio in the normal or in the neurotic group. The high correla- 
tions between Hertz’s R+ and Beck’s F+ in both normal and neurotic groups in- 
dicate that both ratios measure much the same thing. It seems likely that all F+ 
ratios now in use would also intercorrelate highly. Thus it can be concluded that F+ 
ratios in current use all lead to similar estimates of ego strength, but such estimates 
based upon the new ratio may differ markedly with those of its predecessors particu- 
larly for neurotic subjects. However, the evidence in this study is far from conclusive 
in demonstrating whether the new ratio yields more accurate predictions of behavior 
for neurotics than is true of the older ratios. 
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DIscussion 


Since the results obtained in the present study were in accordance with hy- 
potheses stated, the new F+ ratio seems to merit further study. A more decisive 
proof of the efficacy of the new ratio would be to compare its predictive power with 
that of older F+ ratios in experimental situations like the William’s study “ where 
differences in ego strength should lead to differential reactions. In such studies the 
new ratio might have greater predictive power for neurotic subjects, but not neces- 
sarily for normals. 

When we examine the nature of this new ratio more closely in terms of Ror- 
schach rationale, implicit assumptions seem to emerge about personality dynamics. 
Chiefly, we assume that stressful life situations produce more anxiety for people 
with rigid stereotyped defenses, i.e., individuals with flexible ego structures are not 
so dependent upon conventional stereotyped patterns of behavior to deal with stress 
in their lives. By utilizing only usuals and originals in the new F-+ ratio, it is be- 
lieved that the new measure will more closely reflect these functions of ego strength. 


SUMMARY 


The purpose of this study was to devise a new F+ ratio which might be a more 
valid measure of ego strength for certain neurotic groups than is true of F+ 
ratios in current use. 

The new ratio was based upon the percentage of usual and original responses of 
good form to the total number of responses excepting formless concepts such as 
C, K. ete. 

When the records of a group of normals and a group of neurotics were compared, 
there was a significant difference between the scores of the two groups on this 
new ratio but no difference on Hertz’s R+ or Beck’s F+ ratios. 


A correlation matrix was obtained from scores on the new F+, Hertz’s r+ and 
Beck’s F+ ratios. As predicted above, the correlations were lower for the neu- 
rotic sample than for the normal sample between the new F+ ratio and each of 
the other two ratios. Hertz’s and Beck’s ratios, however, intercorrelated very 
highly in both samples. 

The present findings indicate that the new ratio is sufficiently different from its 
predecessors to justify using it as a complement to the older ratios. 
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THE EFFECTS OF RECENT PERCEPTUAL TRAINING AND 
EXPERIENCE ON RORSCHACH PERFORMANCE! ? 


IRWIN J. KNOPF 
State University of Towa*® 


PROBLEM 


Many investigators would agree that the structural aspects of personality are 
in a sense permanent, and as a result are relatively resistant to change or modifica- 
tion”: "). If stability of personality structure is to be expected, then it should follow 
logically that any test which purports to measure personality structure should re- 
flect this stability in its results. Such leading exponents of the Rorschach method as 
Beck ®), Klopfer®, Hertz“, Halpern“, and Piotrowski®® have, in fact, made 
this claim. 

Numerous studies have demonstrated that under certain conditions Rorschach 
scores can be significantly modified. Conditions such as induced hypnotic states, 
drugs, faking, motor inhibition and hyperactivity, and a variety of attitudinal and 
instructional “sets’’ are among the many which have been employed. To the extent 
that some of these conditions were transitory and superficial, these studies cast con- 
siderable doubt on the validity of the assumption that the Rorschach reveals basic 
personality structure. The recent experiments which have demonstrated that var- 
ious Rorschach scoring variables can be significantly affected by means of varying 
the ‘‘set’”’ of the subject, however, have relied heavily on volitional factors. While 
these results indicate that a subject can wilfully mask or change the presumed pic- 
ture of his personality structure, conscious ‘sets’ of this sort seem to “stack the 
cards” against the validity of this Rorschach assumption in that the subject is told 
how and in what direction he is expected to change. 

This study was designed to determine the “unconscious transfer” effects of re- 
cent perceptual training and experience on certain aspects of the Rorschach per- 
formance, in an attempt to shed additional light on the validity of this Rorschach 
assumption, as well as to extend our knowledge of conditions which may affect the 
stability of Rorschach performance. Certain hypotheses were formulated in con- 
junction with the experimental conditions. 


1. The introduction of perceptual training would alter these features of 
the Rorschach performance: 
a. Increase the number of Dd responses (Dd%). 
b. Increase the number of pure form responses (F%). 
c. Increase the number of animal responses (A%). 


2. The introduction of perceptual experience with color would modify 
the Rorschach performance as follows: 
a. Increase or decrease the sum of the color responses. 
b. If such alteration was found there would not be a corresponding 
change in the M responses. 


PROCEDURE 


A total of 48 college students, 24 females and 24 males between the ages of 17 
and 24 were randomly selected from a list of volunteers in the Introductory Psy- 
chology courses at Northwestern University. All of the subjects were considered 


“naive” in that they had not been exposed to the Rorschach Test prior to this ex- 
periment. 


1This is a modified version of a dissertation submitted to the Graduate School of Northwestern 
University in partial fulfillment of the requirements for the degree of Doctor of Philosophy. The 
writer wishes to express his appreciation to Drs. T. W. Richards, C. P. Duncan, E. L. Clark, and J. 
W. Cotton for their valuable suggestions and counsel. 

*Presented at the annual Midwestern Psychological Association meeting in Chicago, 1953. 

‘From the Psychopathic Hospital and the Dept. of Psychiatry, College of Medicine. 
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The basic design of the study involved Rorschach test-retest, spaced three weeks 
apart, for all subjects. The standard procedure recommended by Beck? for the 
administration and scoring of the test was employed. Immediately preceding the 
second Rorschach testing the subjects were exposed to one of three conditions for a 
period of approximately 30 minutes. After the first Rorschach administration, the 
48 subjects were assigned and matched to one of three groups on the basis of their 
scores for R, Dd%, F%, A%, sum of C, and sum of M. There were eight females and 
eight males in each group, making a total of 3 groups of 16 subjects. The identity 
of the subjects was concealed from the Experimenter by code numbers assigned in- 
dependently during the matching process. Additional precautions were included to 
prevent ‘“‘conscious” transfer from the experimental conditions to the final Rorschach 
performance. The subjects were informed when initially contacted that they were to 
serve in two separate and different experiments. Moreover, the experimental and 
control conditions were administered by three assistant Experimenters*, as well as in 
a different building. Each assistant served an equal number of times for each of the 
three conditions, so that the individual effects of the assistant Experimenters were 
equally divided among the three conditions. 


DESCRIPTION OF THE CONDITIONS 


Experimental Condition I consisted of the presentation of a series of 14 2” x 2” 
slides, half of which were achromatic, and half of which were chromatic.’ A fifteenth 
slide, which was both black and white and colored, was used in conjunction with the 
instructions for practice purposes. In each instance there was an apparent content 
of structured and conventional objects. However, hidden in each of the slides as a 
small detail of the larger figure was one or more animals or animal parts. Each slide 
was exposed for ten seconds, with a ten-second interval between exposures. Four 
trials of 14 exposures per trial, together with the instructions and practice slide, 
comprised the entire session. The order of presentation of the slides was predeter- 
mined and fixed so that it was the same for each subject and the same for each trial. 
The subjects were required to determine the total number of hidden figures for each 
slide, to identify them, and to locate them. 


Experimental Condition II consisted of the individual presentation of a sound 
motion picture film entitled “The Nature of Color’. While the film involved a vivid 
color experience, its contents were objective and non-personal. Following the pre- 
sentation of the film, the subject was asked to write all that he could remember about 


the movie for the bogus purpose of evaluating the effectiveness of films for college 
teaching. 


Control Condition III consisted of a list of 14 nonsense syllables presented in a 
serial order on a memory drum, with the first syllable used as the start signal. Stand- 
ard two-second exposures and procedures were followed. A total of forty trials com- 
prised the entire session. 


RESULTS 


Table 1 gives the mean scores obtained from the first and the second Rorschach 
administrations for the experimental and control groups on each of the six Ror- 
schach scoring variables. From this it will be noted that the mean values on the 
initial Rorschach performance were approximately equal for all groups with respect 
to these scores. 

In order to control for any inter-subject variation on the initial test scores 
which may not have been eliminated in the matching process, analysis of co-var- 
iance was applied to the data for each scoring variable. Each analysis included the 
variance contributed by each of the three groups. A summary of the F values ob- 


‘The author is indebted to Misses Katherine Jones, Barbara Breslower, and to Dr. Victor E. 
ao. for serving as assistant experimenters. 
he writer is most grateful to his wife, Roberta, for her work in constructing these slides. 
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TaBie 1. Rorscuacn Test-Retrest MEAN ScorEs For Eacu Group or 16 Sussects 








Rorschach Group I Group IT Group III 
(Controls) 
Test Retest 


Variable (Perceptual) ee 


Test Retest Test test 





R 24.75 32.25 26.56 26 .37 24.87 26.80 
58.14 63.31 57.69 58.62 57.69 56.76 
46.47 49.80 45.36 44.20 47.35 44.68 
6.18 12.42 5.48 7.95 6.55 8.79 
3.37 3.94 3.34 2.97 3.34 3.47 
1.87 2.00 3.06 3.50 2.75 2.69 














Taste 2. Summary or F VALUES AND PROBABILITIES FROM ANALYSIS OF 
Co-VARIANCE OF THE First aND Seconp RorscHacuH TEsTs FOR ALL 
Groups (48 Ss) 








Rorschach Variable F Value* Probability 
R 2.99 .060 
F% 1.20 
A% 2.63 
Dd% 2.49 
Cc .83 
M .75 














*With 2 and 44 degrees of freedom the F values for the .05 and .01 levels of 
confidence are 3.21 and 5.12 respectively. 


tained and their probabilities are shown in Table 2. By means of a technique intro- 
duced by Burke“, it was possible to estimate the actual probability values of F. 

The results indicate that the three groups were not significantly different on 
their second Rorschach performance with respect to the sum of C and sum of M 
scores. The F test obtained between the means of the three groups for the F% score 
indicates that the groups were not significantly different with respect to this variable. 
The F values of the R, A%, and Dd% scores, however, were statistically reliable 
at the .060, .062, and .087 levels of confidence respectively. Inspection of the mean 
difference suggests that these F values were most influenced by and attributed to the 
mean increases from the first to the second tests associated with Experimental Condi- 
tion I. While these findings do not meet the customary criteria for statistical sig- 
nificance, they may reflect important trends in terms of the effect of perceptual 
training on the Rorschach performance. 

In an effort to elucidate these marginal findings, this Experimental condition 
was repeated with 11 additional subjects, five females and six males. The subjects 
were obtained in the same manner as in the first experiment, and wherever possible 
the procedure and experimental design were identical. — 

Mean scores for the first and second Rorschach testing and the mean differences 
between the two test performances are listed in Table 3. The results obtained from 
separate analyses of co-variance indicate no statistically reliable F value for any of 
the scores in question, namely: R, F%, A%, or Dd%. However, in each instance 
the mean difference was in the same direction as previously found and also was nearly 
of the same magnitude. 
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TaBLE 3. Rorscuacu Test-REetTEesT MEAN Scores AND MEAN DIFFERENCES FOR THE REPLICATED 
Group or 11 Supsects 








Rorschach Mean 
Variable Mean Scores Differences 


Ist Test 2nd Test 
23.73 26.00 2.27 
60.42 69.31 8.89 
48 .66 55.91 7.25 

6.80 11.87 5.07 




















Discussion 


The results obtained from Experimental Condition II (the film on color) in- 
dicated clearly that this experience failed to alter appreciably the sum of C or the 
sum of M scores on the Rorschach performance. Insofar as this condition is concern- 
ed, these findings would seem to lend support to those who believe that the “core” 
of the personality as indicated by the Rorschach method changes very little. On the 
other hand, the results obtained from Experimental Condition I (perceptual training) 
and its replication consistently point to an increase in the R, A%, and Dd% scores 
as a consequence of this training experience. It is apparent, however, that the in- 
creases in scores were small, and in order to demonstrate this effect at a statistically 
reliable level it would be necessary to increase the size of the sample considerably to 
reduce the error term in order to obtain large enough F values. At best, therefore, it 
could be said that while this experience has increased these Rorschach scores, the 
effect is small and perhaps negligible. 

The results of other studies indicate clearly that “‘set’’ is an important variable 
affecting Rorschach performance. Under conditions of specific instructions to change, 
Hutt et al), and Gibby “ found that the F+, D+, Dd, W, R, range of content, and 
A% scores were significantly altered, while no concomitant modification was observ- 
ed in the M:C ratio and the M and C scores. However, when the subjects were in- 
structed to perceive more M, this score significantly increased. The findings of the 
present study may reflect the effects of two other points on the continuum of “‘set’’. 
The perceptual training experience involved a less direct ‘‘set’’ in which the subjects 
were given an orientation to perceptual material. While this condition did seem to 
increase some scores slightly, mere exposure to a film on color did not affect the 
stability of the Rorschach scores studied. Thus, it would appear that there is a re- 
lationship between the strength of the “‘set’’ and the modification of Rorschach 
performance. 

Nevertheless, at least two alternative explanations remain: (1) certain aspects 
of the Rorschach performance can be altered more readily than others, and (2) inter- 
action between ‘‘set” and the stability of some Rorschach scores may be operating 
here. While the present data do not provide an unqualified choice of one of these 
alternatives, they do suggest that similar explorations of Rorschach scoring variables 
under a wide variety of conditions would be helpful in reducing sources of errors, 
and increasing the validity and predictability of Rorschach interpretations. 


SUMMARY 


The effects of two experimental conditions on six Rorschach scoring variables 
were studied. One group of 16 subjects was trained to perceive small animals or 
animal parts hidden in the apparent content in each of a series of 14 slides. Another 
16 subjects were exposed to a non-personal film on color. A control group of 16 subjects 
learned a serial list of nonsense syllables. All subjects were given the Rorschach 
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twice, spaced three weeks apart, with the experimental conditions immediately pre- 
ceding the second Rorschach testing. The results indicated that the color experience 
had no appreciable effect on the sum of C or sum of M scores, whereas the R, A%, 
and Dd% scores under the perceptual training condition very closely approached 
statistical significance at the .060, .062, and .087 levels of confidence respectively. 
The F% score for the latter condition was not significantly affected. The perceptual 
training condition was repeated with 11 additional subjects. While the resulting 
mean differences were in the same direction as previously found, separate F tests 
yielded values smaller than those required for the customary levels of significance. 
These results point to the slight and perhaps negligible effect of this perceptual ex- 
perience on the Rorschach performance. 
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CONTRIBUTIONS TO A THEORY FOR SELECTING PSYCHOTHERAPY 
PATIENTS 


FRANK AULD, JR. AND JEROME K. MYERS! 
Yale University 


INTRODUCTION 


Selecting suitable patients for psychotherapy is an important task, in which the 
therapist welcomes help from any quarter“?: . , In his decision to accept or reject 
a patient he is quite often influenced by the results of psychological tests. There is 
no general agreement, however, on what test results ought to persuade a therapist 
to accept a patient, and what results ought to induce him to reject a patient. Ob- 
viously, then, research on the use of psychological tests in selection of patients is 
much needed. 

Research on the validity of tests as guides to selection of patients can follow 
either of two broad lines. The first approach may be called empirical. The investi- 
gator who takes this approach chooses some criterion of suitability of the patient 
(e. g. the therapist’s judgment of the success of treatment or the number of inter- 
views the patient stayed in therapy) and finds out what tests or test items are cor- 


1Dr. Leslie Schaffer, formerly of the Yale Deportes of Psychiatry and now serving in the 


Army, collaborated with Myers in the collection of the data presented in this paper. Undoubtedly 
his ideas, as well as the facts he helped to gather, have shaped our pgronsrs hem the matters discussed _ 
in this paper. We are glad to acknowledge Dr. Schaffer’s important contribution. The investigation 

was partly supported by a research grant from the U. 8. Public Health Service. 
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related with this criterion measure. Any test or test item that is correlated with the 
criterion is thereafter used as a predictor, i.e., is used to select patients. The investi- 
gator is usually not interested in how or why the test predicts; he is satisfied if it 
seems to predict. 

The other approach might be called rational, because the investigator taking 
this approach attempts to understand the how and. why, then tries to make pre- 
dictions. He decides first what traits make a patient suitable for psychotherapy (in 
deciding this, he is probably guided by clinical experience, since no systematic studies 
exist), then he tries to measure these traits with reliable and valid instruments. The 
investigator taking this approach reaches the stage of application later, since he has 
to know what his tests are measuring before he can use them. But (we shall argue) 
his results are more likely to stand the test of time. 

Examples of the empirical approach are not hard to find. Kotkov and Mead- 
ow) and Rogers, Knauss, and Hammond “*) did empirically-minded studies, using 
as the criterion length of therapy; and Harris and Christiansen“? and Rogers and 
Hammond“? did similar studies, with success of therapy as the criterion. No ex- 
ample of a study in which the rational approach was used is known to the authors. 


DIFFICULTIES OF THE EMPIRICAL APPROACH 


The investigator who takes the empirical approach is beset by many difficulties. 
If he tries out a large number of test items, by chance he will discover some of them 
to be correlated with his criterion. This makes cross-validation studies necessary. 
Furthermore, the investigator cannot be sure that the correlations between test 
items and criterion hold generally true for all prospective patients, or whether the 
correlations he discovered are only true for the particular type of patient he happened 
to study. This makes validity-generalization studies necessary“). But even more 
important, the investigator doesn’t understand very well avy test items are cor- 
related with his criterion. Of course, he can try to explain the correlations after he 
gets them. Many ingenious ad hoc explanations have been invented! But often the 
investigator gives the wrong reasons instead of the right ones. 

Not knowing why he could make the predictions is a serious handicap to the 
investigator. Because of his ignorance, he can’t tell in which samples his prediction 
equations will work and in which ones they will not. Furthermore, he is stuck with 
the test items he first tried out, even if these happen to be laborious and awkward 
measures of the trait that makes prediction possible. For instance, if it should turn 
out that the total number of responses to the Rorschach predicts continuance in 
psychotherapy only because this item is a measure of verbal intelligence, it is obvious 
that measuring intelligence in this way would be inefficient. The verbal scale of the 
Wechsler-Bellevue would be easier to administer and would give better predic- 
tions ®?, 


First Sters Towarp A THEORY FOR PREDICTING LENGTH OF PSYCHOTHERAPY 


Since we believe that the rational approach has many advantages over a purely 
empirical one, we shall here try to make a start on constructing a theory for predict- 
ing length of psychotherapy. In this paper we wish to discuss only one characteristic 
of the patient, his social position. We realize, of course, that a comprehensive theory 
for selecting patients will have to embrace a large number of aspects of the patient’s 
background, personality, and current life circumstances. 

_ Dollard found that some middle-class patients come to therapy almost ready 
to jump on the couch and begin free association, without need for a word of in- 
struction!’ Upper-middle-class patients generally have heard of psychoanalysis and 
psychotherapy; they know what a psychiatrist does, that he helps patients by talk- 
ing with them. Lower-class patients are apparently less well-informed. Their ex- 
periences with doctors are with physicians who have treated physical complaints. 


*Personal communication from Dr. John Dollard. 
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As Freud said ‘7: »?. 5-46) belief in the efficacy of psychotherapy is probably of little 
importance; nevertheless, misinformation or wrong expectations could be an ob- 
stacle to the therapeutic work. Colby “ has noticed that the middle-class patient is 
more likely to be ‘‘psychologically minded.” He feels that this greater receptivity 
to therapy is a definite advantage. 

The social position of the patient not only influences what he knows and what 
he expects, but also seems to determine to some extent what rewards he will get out 
of therapy.* Patients are rewarded in many ways by psychotherapy: in the ideal 
case, they are rewarded when they learn and try out new, more adaptive habits; 
they are also rewarded by the therapist’s attention, by his understanding, and by 
his empathic interest in them. To get the rewards of more adaptive habits, life condi- 
tions must offer rewards for adaptive behavior. Perhaps in some cases the lower- 
class patient’s life has little to offer to reinforce a change in behavior; or, even if 
changed behavior could be rewarded, the middle-class therapist may be unfamiliar 
with the conditions of lower-class life and therefore unable to act as a dependable 
guide to the patient. To get the rewards of the therapist’s understanding and inter- 
est, the patient must make himself understood. He finds it harder to do this if he is a 
lower-class patient. The therapist doesn’t understand his language or the conditions 
of his life. Finally, to be rewarded by the therapist the patient must be able to under- 
stand what the therapist is saying. This is less often possible when the patient is 
lower-class and the therapist middle-class. Moreover, when we turn to consider the 
punishments in the situation we see that the fee (even if it is a small fee) is more 
punishing to the lower-class patient than to the middle-class one. 

Not only does the lower-class patient get less reward and more punishment from 
psychotherapy, but also his therapist gets less reward from treating him. The middle- 
class therapist is unfamiliar with the conditions of life of the lower-class patient. 
Therefore he finds it harder to be sympathetic and genuinely interested. He finds 
it harder to have empathic reactions to what the patient tells him. He confronts a 


more difficult task in instructing the lower-class patient, who, because of his different 
class-ways, may seem stupid. 

For all these reasons, therefore, we should expect that lower-class patients would 
do less well in psychotherapy than middle-class patients. We should expect that their 


cases would be judged as less successful and that they would quit or be dismissed 
more quickly. 


An EmptricaL Stupy oF THE RELATION oF SoctaAL Position To LENGTH OF 
TREATMENT 


With this background of theory, we proceeded to find out whether our expecta- ~ 
tions would be borne out by data. Schaffer and Myers“: '*) made a comprehensive 
investigation of the workings of an outpatient psychiatric clinic. Their investigation 
— us with data concerning the relation of social position to length of psycho- 
therapy. 

Schaffer and Myers measured patients’ social position by Hollingshead’s Index 
of Social Position “** %), This Index is computed from scores on three variables: 
occupation, education, and ecological area of residence. The Index scores permit the 
assignment of each patient to one of five social levels. Class I comprises families of 
wealth, education, and top social prestige. Class II consists of families in which the 
adults for the most part hold college degrees and in which the fathers have profess- 
ional or high-level managerial occupations. Class III includes proprietors, white- 
collar workers, and skilled workers; they are mostly high-school graduates. Class 
IV consists largely of semiskilled workers and laborers, with less than high-school 
education. Class V includes unskilled and semiskilled workers, who have a grade- 
school education or less, and who live in the poorest areas of the community. 


Our analysis of the interaction between patient and therapist parallels the analysis of teacher- 
pupil interaction so well done by Davis and Dollard (6. pp- 279-290), 
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The dependence of the number of interviews on social position is shown in 
Table 1. The mean number of interviews for the Class II and Class III patients was 
24.7; the mean number for the Class IV and Class V patients was 11.0. To get a 
measure of the degree of the relationship between length of treatment and social 
position, we computed a biserial correlation. The biserial r between class (the dicho- 
tomized variable) and number of interviews is +.403, which is significant at the 
.01 level. 


Discussion 


The results of the study of the outpatient clinic are consistent with the hypo- 
thesis that social position of the patient influences his response to psychotherapy. 
These results strengthen our belief that in the interactive process between patient 
and therapist, there are more rewards for both patient and therapist when the patient 
belongs to the middle class. Another factor, however, probably also plays a part. 
Schaffer and Myers“: © found that the Class II patients were likely to be assigned 
to the senior staff, the Class III and IV patients to residents, the Class V patients 
to medical students (patients assigned to medical students were not included in 
Table 1, however, because students worked in the clinic for clerkship periods of only 


TaBLe 1. Lenorn oF TREATMENT AND Socrat Ciass 








Per cent of patients 
Number of 
interviews Classes II and III Classes IV and V 
1to7 25.0 44.0 


8 to 19 32.5 36.0 








20 and over 42.5 20.0 











Note.—All patients reported on in this table were treated by residents or senior 
staff. There are 40 patients in Classes IT and ITI and 25 in Classes IV and V. 


four weeks and this arrangement set an artificial limit on the length of therapy of 
their patients). Because of this, it can be seen that the middle-class patient is likely 
to get a better-trained therapist. It seems reasonable to suppose that those patients 
who get more experienced, more skillful therapists will stay in therapy longer.‘ 

Since length of therapy is related to the social position of the patient, one could 
predict from the patient’s social position how long he will stay in therapy. A word 
of caution is in order, however; we are not sure that the relationship between social 
position and length of treatment would hold in other clinics. This must be deter- 
mined by further studies. 

Any measure of social position might give some prediction of how long patients 
will stay in psychotherapy. For example, some Rorschach scores may measure social 
position in a rough way, as Auld“: *) found. A study by Beck et al“ produced similar 
findings; executives, skilled workers, and semiskilled workers gave an average of 35 
responses to the Rorschach, but unskilled workers had an average of only 25. The 
correlation of test scores with social pusition would make the scores rough measures 
of social position; and this might permit one to predict length of therapy by means 
of the test scores. However, if social position of the patient is the only variable that 
makes prediction of length of therapy possible by these scores, we believe it would 
be wiser to use a precise measure of social position (like Hollingshead’s Index) rather 
than a rough and awkward measure. 


‘Our explanation here is similar to one given by Miles et al.“° for a relationship between IQ 
and success of therapy. 
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SUMMARY AND CONCLUSIONS 


Research on the validity of tests for selecting psychotherapy patients can follow 
two approaches: empirical and rational. The purely empirical approach is beset by 
many difficulties. It is argued that the rational approach is likely to be more fruitful. 

In this paper we have tried to make a start on constructing a theory for pre- 
dicting length of psychotherapy. We showed why it should be expected that middle- 
class patients should stay in therapy longer. The experience of one psychiatric out- 
patient clinic was examined. It was found that in this clinic middle-class patients 
stayed in therapy longer. This finding was interpreted as being consistent with the 
hypothesis concerning effect of social position on the patient’s response to psycho- 
therapy. 

We do not believe that our findings justify selecting middle-class patients and 
rejecting lower-class patients. (As a matter of fact, Schaffer and Myers found that 
the clinic unwittingly was doing just that.) Instead, we feel that these facts under- 
line the need for therapists and administrators to understand better the social back- 
ground of their patients. Perhaps, too, therapists should consider changes in tech- 


nique to meet the needs of lower-class patients, as Freud suggested many years ago 
(7, Pp. 401-402 ) 
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A STUDY OF TWELVE-YEAR-OLD RECIDIVISTS 
WILLIAM W. WATTENBERG AND FRANK QUIROZ 
Wayne University 


PROBLEM 

The present study is a follow-up to an investigation “ of factors associated with 
recidivism among eleven-year-old boys in trouble with the police in Detroit. In that 
study it had been found that young boys with more than one police contact tended 
to have poor school records, low mental ability, membership in unruly gangs, and a 
reputation for mischief. The connection between repeating and the usually reported 
variables of poor family conditions and low socio-economic status was present but 
was not strong enough to produce tabulations with statistical reliability sufficient 
to warrant rejection of the null hypothesis at the two per cent level of confidence. 

It was decided to repeat the study among twelve-year-olds because evidence in 
the literature indicated that reported delinquency showed an upsurge in frequency 
during that year of life. This was especially evident in research reports based on 
British experience. In describing a study made in 1948 of young people in Birming- 
ham (England), Reed “ stated that the peak age for juveniles referred to the courts 
was twelve. In an earlier study with a larger group, Carr-Saunders and his co- 
workers? had found a peak at thirteen. 

In American experience, the modal age comes later if the total of boys in trouble 
with the police or courts in any given year of life is taken as the index. It was fifteen 
or sixteen according to Banay’s analysis“? of U. 8. Children’s Bureau statistics. The 
Manns“) found that fifteen-year-olds furnished the largest proportion of delinquents. 
However, if age at first court appearance is taken as the index, the early upsurge be- 
comes evident. The Gluecks®? reported that various measures of central tendency 
fell between the twelfth and thirteenth birthdays. 

This situation appears to be the culmination of a long process rather than the 
result of any sudden change in juvenile behavior patterns. Writing from the vantage 
point of work in a school system, Kvaraceus“? noted that in most instances young- 
sters were referred to the courts only after the police had given them several oppor- 
tunities to ‘‘mend their ways.” Presumably, the police encounter quite a few young 
boys who are reported for various offenses but who merit lenient treatment. If re- 
leased to their parents with a warning, such boys do not get in trouble again. By 
contrast, the group which finally appears in court do not respond to this type of 
handling. Irritated or discouraged by the persistence of misconduct and neighbor- 
hood complaints in their cases, a police officer finally decides they are beyond his re- 
sources and invokes formal court procedures. 

It is important to recognize that although in the cases of boys declared de- 
linquent by courts there is almost always a history of early misbehavior, this does 
not imply that all boys who get in trouble at a tender age go on to become seriously 
delinquent. Detroit Police Youth Bureau records reveal that more than three- 
quarters of the boys ten or eleven years old at the time of their first police contact 
have had no complaints registered against them in a subsequent year. This poses a 
very practical question: Which items of information, alone or in combination, dis- 
criminate between those youngsters whose early misconduct is predictive of serious 
trouble and those who can wisely be released to the resources of their families and 
neighborhoods? Since few communities have adequate facilities to deal with all 
children who need special handling, it is important to screen out those boys whose 
development would lead to a happy outcome. Then, it would be possible to use the 
page facilities for those youngsters in whose lives special treatment might be 
crucial. 

For purposes of prognosis, among the eleven-year-olds previously studied ®?, 
factors linked to school adjustment and to gang activities had proved of greater sig- 
nificance than those involving family situations and neighborhood conditions. The 
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present study was undertaken to find out whether or not this pattern was equally 
true for twelve-year-olds. It was realized, of course, that the mere addition of one 
year of chronological age would hardly produce any drastic reversals. Continued 
school maladjustment and membership in unruly gangs would, if anything, become 
more potent influences as a boy aged. However, in view of the increase in delin- 
quency at this age level, it was suspected that the twelve-year-olds with repeated 
police contacts would be more similar to older delinquents in the sense that family 
and neighborhood would rise in significance for them. At any rate, this was the hypo- 
thesis we set out to test. 


PROCEDURE 


With assistance from the Youth Bureau of the Detroit Police Department* the 
police records of all boys “interviewed on complaint” in 1950 were obtained. Those 
who were twelve years old at the time of contact, some 460 in all, were selected for 
study. The records indicated whether the boy had been involved in previous com- 
plaints. On this basis, the group was divided into 130 repeaters and 330 non-re- 

eaters. 

i In connection with each contact, special Youth Bureau officers had made an 
investigation which included interviews with the boy, visits to his home, and ex- 
change of information with the school and with other social agencies known to be 
interested. The information thus secured was recorded on a “history sheet,” coded, 
and punched on IBM ecards. For 1950, these contained fifty items relating to the 
boy, his school, his family, and his neighborhood. For each such item, a table was 
prepared comparing the repeaters and the non-repeaters. All tables were submitted 
to the chi-square test to determine the statistical significance with which the null 
hypothesis could be rejected. 


FINDINGS 


Of the fifty tables, twelve showed relationships significant at the one per cent 
level; four, at the two per cent level. These are listed in Table 1. The number of 
statistically significant items was sixteen times chance expectation. As in the study 
of eleven-year-olds, school adjustment and a very rough indication of mental ability 
were related to repeating. At the one per cent level of confidence, more of the re- 
peaters: (1) were rated of ‘below average mentality,” (2) expressed unfavorable atti- 
tudes toward school, (3) were receiving poor marks, and (4) were out of their regular 
grades. 

The relationship between gang activity and repeating likewise remained sig- 
nificant. At the two per cent level of confidence, more of the repeaters belonged to 
definitely organized gangs. More interesting was the officers’ classification of the 
group’s activities. On the basis of their knowledge, the policemen considered that 
the majority of the non-repeaters belonged to orderly groups. By contrast, most 
repeaters belonged to gangs regarded as ‘‘rambunctious;” a significant minority 
went in for organized theft. 

As predicted by our hypothesis, family factors were also significantly related to 
repeating. At the one per cent level of confidence, more repeaters expressed un- 
favorable attitudes toward their homes and fewer of their parents took a forgiving 
or understanding attitude toward them. At the two per cent level of confidence, 
more of the repeaters came from homes in which the parents seldom or never parti- 
cipated in the boy’s activities. The officers found that almost all parents of non- 
repeaters were making an effort to understand or back up their sons. 

Before we can accept at face value the finding that family relationships were 
statistically significant for the twelve-year-olds but not for the eleven-year-olds we 
must take into account the fact that the population of twelve-year-olds in trouble 


*The authors wish to express their gratitude to Senior Inspector Sanford Shoults, Inspector Ralph 
Baker, and Lieutenant Francis ate for their aid not only in making records available but for giving 


generously of their time in securing the basic tabluations upon which this research is based. 
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TaBLe 1. Items LINKED witH REPEATING 








Subject of Tabulation 


Item Positively Associated 
with Repeating 


Level of Con- 
fidence (P) 





Grade in school 


Retarded or in special class 


01 





Attitude toward school 


Unfavorable 


01 





School marks 


Poor 


-O1 





Rating of intelligence 


Below average 


01 





Gang membership 


Member 


02 





Gang activity 


Disorderly; organized theft 


.O1 





Attitude toward home 


Unfavorable 


01 





Attitude of parents 


Punitive, rejecting 





Parents’ participation in 
boys’ activities 


Seldom; never 





Condition of dwelling unit 


Substandard; needs repairs 





Rating of neighborhood 


Slum 





Family owns a car 


Yes 





Appearance of boy 


Slovenly 





“Type”’ of boy 


“Peck’s Bad Boy” 





Religion Non-Jewish 


Attitude toward officer 








Evasive 








with the police was larger. Our studies included 460 twelve-year-olds but only 334 
eleven-year-olds. The chi-square test is peculiarly susceptible to the size of the 
group being studied. Therefore, it was necessary to go behind the simple test of 
significance to make sure that the apparent increase was not an artifact of the size 
of population. ‘This was done by comparing the percentages for the categories in each 
subgroup. On this basis, no important shift was found for the tables having to do 
with the parents’ participation in the boys’ activities. Surviving this second test 
was the increase in significance for the boys’ open expression of dislike for their 
homes. Among the eleven-year-olds, 4.0 per cent of the repeaters and 3.4 per cent 
of the non-repeaters had voiced such attitudes. Among the twelve-year-olds, 6.1 per 
cent of the repeaters but only 1.8 per cent of the non-repeaters expressed hostile 
attitudes. 

On the matter of socio-economic indices, the results were more clearcut. Three 
such variables showed a statistically significant relationship to repeating. At the 
two per cent level of confidence, repeaters tended to come more often from dwelling 
units rated as needing repairs or being substandard. At the one per cent level of con- 
fidence, they tended to come from neighborhoods rated by police officers as “slums” 
and from families not possessing automobiles. 

Here, again, it was necessary to go behind the simple calculation of significance 
to determine whether or not the apparent rise to reliability of these factors was an 
artifact of the increase in size of the population concerned. Comparing percentages, 
it appeared that the condition of housing presented a dubious case. However, on the 
other two items a genuine shift appeared to have taken place. The rating of “slum” 
had been given to the neighborhoods of 13.1 per cent of the eleven-year-old repeaters 
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and 14.3 per cent of the non-repeaters; for the twelve-year-olds, to 24.6 per cent of 
the repeaters as against 14.2 per cent of the non-repeaters. Families not possessing 
cars among the eleven-year-olds involved 59.6 per cent of the repeaters and 48.9 
per cent of the non-repeaters as compared with 67.8 per cent of the twelve-year-old 
repeaters and 51.0 per cent of the non-repeaters. 

Four other tabulations, not previously reported, also rose to statistical signi- 
ficance. At the one per cent level of confidence, the twelve-year-old repeaters were 
found to have been rated more frequently by the Youth Bureau officers as (1) sloven- 
ly in appearance, (2) being like ‘‘Peck’s Bad Boy,” and (3) “evasive” in their be- 
havior toward the interviewing officer. At the two per cent level of confidence, 
Jewish boys were absent from the ranks of the repeaters. A comparison of per- 
centages showed that, with the exception of the slovenly appearance, these were not 
new relationships but owed their newly acquired statistical significance to the in- 
crease in size of population. 


Discussion 


The findings tended to uphold the original hypothesis. School adjustment and 
gang activities retained the statistically significant relationship to repeating which 
had prevailed among the eleven-year-olds. In line with the prediction, one index of 
poor family relationship and two items reflecting low socio-economic status joined 
the list of significant items in a manner which reflected an alteration in relationships 
rather than a spurious statistical effect. In addition, the new statistical significance 
of ratings of personal appearance gave additional credence to the view that the 
twelve-year-old repeaters did conform more closely than had the eleven-year-olds to 
popular stereotypes of ‘‘the juvenile delinquent.” 

There is no surprise, and little point to discussion, in the fact that school ad- 
justment and the nature of the boys’ gang activities should be related to repeating. 
In almost all studies, school retardation has been found more prevalent among de- 
linquents of all ages. In an earlier study Wattenberg? had found that juvenile 
arrest statistics showed an unseasonal decline in the first month of summer vacation 
periods. There seems little question but that school maladjustment is at least a pre- 
cipitating factor in the delinquency of some youngsters. Similarly, boys who belong 
to gangs which engage in anti-social conduct are prone to continue getting in trouble 
along with their companions. 

Turning to the stronger effect of bad parent-child relationships among twelve- 
year-olds, it should be pointed out that the new element in the picture seems to be a 
more openly hostile attitude on the part of the boy. There is some question as to 
whether this is a cause or an effect of repeated police contacts. It may well be that 
the pressure within the family resulting from repeated official action may embitter 
the youngster. Where the home situation was poor, however, the boy might turn 
with greater intensity to get satisfactions with his gang. This could well lead to his 
making a more open emotional break with his parents. It would also serve as a 
practical guarantee that he would continue getting in trouble. Thus, although 
parent-child relationships might have relatively weak influence in preventing initial 
experimentation with adult-challenging behavior they could be critical in determin- 
ing its continuation over a period of even one year. We should expect to find among 
twelve-year-old repeaters a larger residual group of youngsters expressing a dislike 
of their homes. 

This may provide partial explanations as to why the police finally refer the boys 
to courts and why bad family situations are so common among youngsters finally 
adjudged delinquent by court action. Boys reaching the courts are not a cross- 
section of all offenders. Rather, they are those left over when the less forceful ele- 
ments of family “education” have patently failed and police have given up hope. 

The rise in potency of socio-economic factors can be explained in much less 
complex a manner. In their initial efforts at independence, youngsters in almost any 
neighborhood would do some things which could bring complaints to the police. In 


\ 
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the areas with high delinquency rates, they have more models to copy. This could 
also explain why they seemed to conform to stereotyped conceptions of “tough 
kids.”” Their sloppy appearance could well reflect efforts to ape an older group of 
neighborhood “heroes.” By twelve years of age, apparently, their induction would 
be in full swing. Recognizing this, the police could be expected to start filing official 
petitions against them. 

To test further the possibilities set forth in this section, a series of longitudinal 
studies is required. 


SUMMARY 


A group of 460 twelve-year-old boys contacted in 1950 by Detroit police was 
studied to find out what factors distinguished recidivists from non-recidivists. As 
has been found in a previous study of eleven-year-olds, school maladjustment and 
the nature of gang activities were significantly related to repeating. Unlike the 
younger group, the twelve-year-old repeaters came from a statistically significant 
larger proportion of homes with poor parent-child relationships and low socio- 
economic standing. They also bore greater similarity to stereotypes of “‘the juvenile 
delinquent” in appearance. 
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THE USE OF DRAWINGS OF THE HUMAN FIGURE AS AN ADJUNCT IN 
PSYCHOTHERAPY 


FRANCIS W. KING 
Dartmouth College 


INTRODUCTION 

Despite the growing literature on the evaluative and predictive roles of pro- 
jective techniques in relation to various kinds of treatment, few references exist on 
the direct use of projective techniques within the therapeutic situation.! The thera- 
peutic use of such techniques is an almost unexplored area“. It is primarily the 
Thematic Apperception Test, and its variants, that has been employed directly by 
clinicians in the therapeutic process“). The major purpose of this brief report is to 
illustrate by a single case the usefulness of employing human figure drawings as an 
adjunct in the course of psychotherapy. It seems to the writer that this paper may 
also be considered as a minor contribution to the diagnostic value of at least one 
characteristic of human figure drawings. 


THE SETTING 


A twenty-year-old male began an extended series of psychotherapeutic inter- 
views in the middle of his junior year in college. There were none of the frequent 


1This, of course, does not pertain to the use of play, art, and drama; these techniques may be 
considered, according to their use, as projective techniques or therapeutic techniques. 
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academic complications for he was a brilliant and efficient student who was elected 
to Phi Beta Kappa during this same year. If diagnostic labels were to be employed, 
the term obsessive-compulsive could not possibly be omitted, for this highly effective 
young man scheduled his activities on college vacations as rigorously as he planned 
his college course work—and he abided by his schedules! “‘Rational,’’ “logical,” and 
“intellectual” were his favorite words—and defenses. Additionally, it should be 
mentioned that his history was marked by a number of homosexual relationships. 

A series of nearly thirty interviews throughout a semester was interrupted for 
sixteen weeks by the college summer vacation. Until this lengthy interruption oc- 
curred, therapy had been progressing in a painfully slow manner, but nonetheless pro- 
gressing. Dynamic material of some depth had occupied a considerable portion of the 
therapeutic hours and some significant, if small, changes took place in attitude and 
behavior. Homosexual activity had ceased and there was some loosening up here and 
there of his many rigidities. 

However, when the interviews were resumed in the fall, weeks passed without 
any apparent therapeutic progress. Detailed accounts of the vicissitudes of a con- 
tinuing friendship made during the summer occupied seemingly endless hours of 
interviews. Finally, in the hope of redirecting the patient’s attention to less extern- 
alized matters, the therapist introduced the draw-a-person technique into the latter 
part of one of the interviews. 


INTERVIEW MATERIAL 
Figures 1 and 2 reproduce the drawings obtained during this hour.” 


| 


a “ne 


Fiaure 2 


The following verbatim transcription presents the patient’s comments while 
executing the drawings and his most revealing subsequent interpretations and 
associations. 


*The original drawings, only faintly sketched for the most part, have been slightly darkened for 
better reproduction. 
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Therapist. Would you draw a picture of a human being? 
Patient. (Laughs) Well, in elementary school and in high school, there were two sub- 


T. 
F. 


jects with which I used to come home with E’s on my report card—penmanship 
and drawing. That was something I could never do. 

I'll not ask you to write, just to draw a picture. 

I haven’t done this since . . . my niece could do a better job (erases) (laughs). 
I hope no one ever sees this. Let’s see, this is supposed to be a profile. I think it 
would be easier if I drew a (erases) .. . 

Turn the sheet over if you like. 

(Turns paper over and starts new drawing) In elementary school or in high 
school—I guess it was in elementary school—the fellow who sat next to me used 
to do my drawing and I used to do his math. I think this is the most difficult 
thing . . . (erases) . . . fingers, modern art. (Last phrase unclear) 

Pardon? 

I said, modern art, and that is supposed to be a human being. 

Fine. Now I want you to draw a picture of the opposite sex. 


Is that supposed to have a sex? (points to Figure 1) I guess that would be a man. 
(He starts to draw a profile again with much erasure) I’m trying to do a profile 
again (turns sheet over) ... (laughs) . . . (erases) A little out of proportion. Looks 
the same. This is, as I said a little while ago, not my field. I. . . find that this 
was something 1 could never do—was drawing anything that looked like any- 
thing; it was something I always wanted to do. My father, on the other hand, 
draws very well—not very well—at least he’s able to do it. And drawing seemed 
to me to be a... . skill of some sort which was similar to athletics possibly in 
some ways. 

What sort of a person is that one? (pointing to Figure 1) 


That looks like something out of Buck Rogers—or a robot. Certainly it looks 
well built, broad shouldered, muscular, I guess. It looks like something a two- 
year-old drew. It always seemed to me that something you could do with your 
hands, a skill, was more worthwhile than something you thought about or came 
to through thinking. When I was young, I guess, oh, five or six or so, I used to 
be a wonderful mechanic and very handy with tools. I used to take every- 
thing in the house apart, and frequently not put it together again. I used to 
know the uses of all the tools. My mother tells me that my father was quite 
sure I was going to be an engineer, and very soon that interest and skill disap- 
peared, although some of the interest was still there. Nevertheless, I... in 
elementary school, we had shop and I wasn’t any great carpenter or anything. 
I had difficulty in getting something straight, or planing an edge properly or 
something. 

Can you tell me more about what kind of a person that is? 


The person doesn’t look very smart, he looks sleepy, I guess. I. . . think that 
what comment I could make about both of these people—they both seem sex- 
less. The only thing that differentiates these two—this one’s (Figure 2) smaller 
than that one (Figure 1), which would possibly make me think of this as a 
woman (Figure 2), and this one’s a man—has broad shoulders. That’s supposed 
to be curly hair on that one (points to Figure 2) but it doesn’t look like it. Looks 
like she needs a shave. But certainly you couldn’t tell that it was a woman, 
looks more like a little boy. I think that if I could draw, I would have possibly 
done something else. In drawing a woman, I think if I could draw, I would have 
attempted to draw a profile which would make it more obvious that what_I,was 
drawing was a woman—than something like this. 

It is hard to tell which is a man and which is a woman because you have a’defect 
in drawing. : 
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That would seem like an awfully stupid reason. It... I could say that sometimes 
I find it very difficult to tell whether I was a man or a woman, and that difficulty 
remains until today. When I find that in certain situations, the role I should be 
playing I could play more accurately as a woman than asa man. One of those 
situations occurred to me the other day—I remember wanting to remember to 
mention it. I guess it slipped my mind. 

Both of these characters look to you to be sexless. 


Yeh . . . Sometimes I’ve thought that if people were actually sexless, that would 
possibly be the easiest way to (record unclear) but on the other hand I some- 
times thought that if I could act the same way toward people of both sexes, that 
would make things much easier . . . As I’ve said it has always been very hard for 
me to understand why it wasn’t possible or permissible to have feelings of 
affection toward a male as it was to have those same feelings toward a female 
but... although it is (word unclear) I certainly would say it was possible to have 
those feelings but it isn’t possible to express them the same way toward both 
eople. 
There is a complete denial of sexuality in these drawings. 


Ummm—the only way in the drawings I attempted to indicate the figure’s sex 
was the curly hair and I was going to try to give her better-formed legs . . . but 
I couldn’t do it. I still would possibly be inclined to say that it might be a defect 
in my drawing ability. I said if I were able to draw better, I might have drawn 
something else. 

You feel that if you could draw better it might have been possible to, if you had 
more skill or talent in drawing, it might have been possible to sketch in more 
definitely indications of sex. 


... Yeh—I think that, as I said a second ago, I would have drawn the female in 
profile, if I was able to draw... but... 


Had you been able to draw a profile it would have been more definitely a woman. 
I think it would have looked more like a woman I think that in my mind I 
can see what I would like to draw. I can see the differences in different pictures 
but when it comes to putting it on paper, it’s not possible. 

You can’t act out your feelings. 


... Well, again I’ve always thought of drawing as a skill or something that would 
have about as much connection with my feelings as the ability to sew or to knit 
or to do something of that sort. 


I think that perhaps in looking at these drawings some of your comments have 
perhaps gone to the core of the problem here. Possibly the repressing of sexuality 
completely on the one hand versus acting the same toward men and women and 


perhaps even more, the most basic of all, is not knowing that you yourself are a 
man or a woman. 


... That... second problem which you mentioned has been one which I guess 
I’ve spoken about in the past, and one which has troubled me particularly on 
many, many occasions. I think that... 

It is not one that we’ve worked on much this year. We’ve managed to skirt 
around it pretty effectively. Pressing minor issues have prevented us from get- 
ting at that perhaps. 

I think that one of the reasons that the Army is. . . could possibly be appealing 
is that it would show quite clearly whether I was a man or a woman. 





DISCUSSION 


Perhaps the most immediately arresting characteristic of these drawings is that, 
in! the words of the patient, ‘‘they both seem sexless.” The patient’s subsequent ad- 
missions offer strong evidence for the diagnostic meaning of the drawing of figures 
in:which there is essentially no sexual differentiation. 


mo 








THE USE OF DRAWINGS OF THE HUMAN FIGURE 69 


However, for purposes of the present paper, the writer wishes to stress the 
effect upon the therapeutic process of employing this simple projective technique. 
After an essentially unproductive series of interviews, the therapeutic activity was 
once more directed toward a very fundamental problem in the patient; in fact, it 
might almost be said simply that therapeutic activity was reinstituted. In his own 
consideration of his drawings, he had brought clearly into the focus of therapy his 
basic confusion of sex identity. Although this confusion had been alluded to pre- 
viously, he had now raised this problem in a different context with new and unavoid- 
able implications. He had now made a start toward grasping the widespread rami- 
fications that this confusion was producing in his interpersonal relations. Thus, there 
had been established, and by the patient himself, a basis with a concrete point of 
reference for the subsequent interviews. 

In this one instance, the use of human figure drawings in the course of psycho- 
therapy was highly, if not dramatically, successful. The writer does not purport to 
imply that such a result is an invariant. However, the technique has been of value 
in other cases. With another student, to cite briefly a different kind of usefulness, 
the drawings produced were extremely simple stick figures. It was suggested by the 
therapist that these drawings expressed compliance marked by minimal cooperation. 
This led directly to the patient’s expression of annoyance at being requested to per- 
form such a task, to a brief discussion of the similarity of the therapist’s manner to 
that of the student’s father, and, finally, to a profitable examination of the patient- 
therapist relationship. 


CONCLUSION 


Generalization from the single case is more than hazardous. However, the 
writer feels that there is ample evidence here to suggest, at the very least, considera- 
tion of using this projective technique within the therapeutic framework. This sug- 


gestion has its historical antecedents in the literature referred to in the introductory 
section of this paper. Although one may surmise that such a therapeutic use of draw- 
ings will be more limited in its effectiveness than the use of the TAT, it offers to be 
adequate to some of the problems in the course of therapy. Furthermore, the present 
technique has its own advantage. Since it is less time consuming in administration 
and since it obviates the necessity of preparing a transcription for the client, its use 
is more flexible within the interview. 
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SELF-PERCEPTIONS IN STUTTERING 
SEYMOUR L. ZELEN, * JOSEPH G. SHEEHAN AND JAMES F. T. BUGENTAL 
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PROBLEM 


A fundamental goal in the treatment of stuttering is the alteration of the 
stutterer’s attitudes and perception of himself. Bryngelson®), Johnson®: ®, and 
Van Riper“*: “ especially have stressed this point, finding the stutterer’s attitude 
responsible for perpetuating the behavior. It is with an awareness of the importance 
of self-attitudes of the stutterer that the present study was undertaken. 

Stutterers are customarily referred to and treated as handicapped persons. 
How much handicaps in general and stuttering specifically influence the self-concept 
of the individual, and thus behavior, is of major clinical import. A previous study 
by Sheehan and Zelen“? on the level of aspiration of stutterers indicated that this 
self-concept has widespread effects. Therefore, it is most important to have some 
objective criteria of what the stutterer’s self-percepts are, as well as some objective 
standards of comparison between the self-percepts of the stutterer and the normal 
speaker. Fiedler and Wepman“? recently explored the self-concepts of ten stutter- 
ers employing the Q technique, finding no characteristic differences from non- 
stutterers. 

METHOD 


Using a more phenomenological and direct approach, the W-A-Y technique’, 
the present study examined the self-perceptions of thirty stutterers. This group was 
then compared with a control group of 160 (all normal speakers) used in previous 
studies“: ?), 


In the W-A-Y technique the subject is told: 


“T don’t know you very well, and so I wonder if you would do something 
that will help me to know you better. Take this piece of paper and write 3 
answers to the question I will ask you. Remember, write 3 answers to the one 
question. You may write anything you wish—words, phrases, sentences—so 
long as you satisfy yourself that you have answered the question three times. 
The question is ‘Who Are You?’ ” 


The test situation was an early meeting of two stutterers’ therapy groups. The 
question ‘Who are you?’ was asked of each group, and then the authors circulated 
among the subjects to assist with individual difficulties in responding. This aid 
simply consisted of restating and clarifying the directions. It should be noted that 
the authors were aware that this was a “classroom” type of situation, and that the 
structuring of segregation into this particular group might tend to create a set to 
have the self-perception “‘stutterer’. Realizing this, they treated the response 
“‘stutterer’ as merely another form of the group membership category. 

This leads to a statement of categories or the dimensions of analysis of the 
responses. There were eleven in all: 

Name 

Familial relationship 
Social positions of prestige 
Occupation 

Age 


Sex 
Group membership, including stutterer 
Unit: pointing out uniqueness of the respondent 
(e.g., myself, me, an individual) 
Positive affect 
Negative affect 
Other—responses which do not fit into the other categories 
(e.g., sports fan, athletic, 5 ft. 9 inches tall) 


*Now at the Child Welfare Research Station, State University of Iowa. 
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RESULTS 


When the stutterers as a group were compared with the non-stuttering group, 
an over-all F ratio was obtained and found to be significant at the 5% level. In- 
dividually the only difference significant at the 1% level of confidence was on the 
“age” dimension, where the “controls” tended to respond with their age more than 
the stutterers. 

The categories of self-percepts which differed at the 5% level of confidence were 
(a) Positive affect, with the stutterers showing more positive affect percepts than the 
normal core group; (b) Unit, with the normal group giving a significantly higher 
amount of unit percepts than the stutterers; (c) Sex, with the core group having more 
self-percepts in the sex-identification dimension; and (d) Group membership. The 
stuttering group responded with significantly more self-percepts involving group 
membership. However, when the percept “‘stutterer’’ is withheld from this category, 
the difference is not significant. 

The comparison of the male and the female stutterers yielded no significant 
differences, but a comparison of the stutterers’ groups divided on the basis of success- 
ful or unsuccessful termination of the course of treatment indicated only one sig- 
nificant difference: Social status, in which at the 5% level the more successful in- 
dividuals tend to have a significantly greater number of self-percepts. 

The Negative affect dimension yielded a difference between the successful and 
unsuccessful stutterers of better than 10% level. While this difference cannot be 
called significant, still it does serve to call attention to the possibility that the in- 
dividuals who were able to cope successfully with their stuttering may tend to have 
less negative feelings about themselves than those stutterers who failed to meet 
adequately the challenge of this therapy. 

The results of the present study are not to be thought of as contradictory to the 
findings of Fiedler and Wepman, but merely different. Although the differences re- 


ported here would not have appeared had we used 10 cases instead of 30, it should be 
pointed out that the dissimilarities in procedure and method of measurement are so 
great that the results cannot be directly compared. 


Discussion 


On 5 dimensions—age, positive affect, sex, unit, and group membership—our 
stutterers differed significantly from normal speakers in their self-percepts. Sur- 
prisingly, these stutterers had more positive feelings about themselves than the core 
group of normal speakers. This finding can be interpreted in either of two ways: 
(1) that stutterers tend to overcompensate for their feelings of inferiority and so 
protect themselves with a halo of positive feeling; (2) more fundamentally per- 
haps, that these particular stutterers, having decided to help themselves through 
psychotherapy, had resolved a major conflict in their phenomenal field and thus had 
more positive feelings about themselves. Unit, age, and sez self-percepts were lower 
in stutterers, and this may be interpreted as being a function of the stutterer’s lessen- 
ed concern with a census-type perceptual orientation and having instead a height- 
ened orientation to his problems. Thus, stutterers have an increased responsiveness 
on the positive-affect dimension and the group-membership dimension, which in- 
cluded the percept ‘‘stutterer’’. 

_ Rogers“, Raimy ®: , Snyder and others have demonstrated that positive 
feelings increase and negative feelings decrease with successful psychotherapy, a 
finding with which the present results are in agreement. Since the successful stutter- 
ers tended to have fewer original negative-affect self-percepts, it is suggested that the 
W-A-Y technique might become useful as part of a battery for predicting response 
to therapy. 

The significantly greater frequency of Social Status relationships in the self- 
percepts of successful patients provides an interesting suggestion of what might be 
one of the bases for favorable response to group treatment. Those patients who have 
strong needs for social status seem to have a greater capacity for group identification 
and thus are able to utilize this by involving themselves more deeply in the group. 
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SUMMARY AND CONCLUSIONS 


The reported self-percepts of 30 stutterers were studied by means of the W-A-Y 
technique. Comparison of the stutterers with 160 non-stutterers yielded signifi- 
cant differences on these dimensions: age, positive affect, unit, sex, and group identi- 
fication. Use of the last-named category was significantly greater for the stutterers 
because it included this self-percept—stutterer. 

The use of the W-A-Y technique for predicting the stutterer’s response to treat- 
ment is suggested by the observed tendency of the eventually successful patients to 
start therapy with fewer negative self-percepts. Possible implications of the results 
for group therapy with stutterers are discussed. 
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THE DIAGNOSTIC VALUE OF THE ROSENZWEIG P-F STUDY 
ROBERT L. BROWN AND OLIVER L. LACEY 


Furman University University of Alabama 


PROBLEM 


The Rosenzweig Picture-Frustration Study, described elsewhere“: 4 5 ©, has 
gained some degree of popularity in the study of frustration at the human level. This 
projective technique was designed to assess reactions to frustration and is based on 
the author’s observations and theory of frustration®: *. According to Rosenzweig: 
“The mental disorder best illustrating each type of reaction would . . . be as follows: 
for the extrapunitive, paranoid complaints; for the intropunitive, compulsive and 
obsessional ones; and for the impunitive, hysterical ones’. “: P. 588) He has also given 
sample cases“) of the patterns of responses found among normals and psychotics. 
Clearly, it is assumed that the technique has diagnostic value—that it differentiates 
in some way between those who are mentally and emotionally disturbed and those 
who are still able to function efficiently in society. 

Does the Rosenzweig Picture-Frustration Study differentiate between various 
clinically labeled groups and, if so, does it differentiate to the extent that it can be 
used as a device in determining the diagnosis of unlabeled individuals? 
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METHOD 


The P-F Study was administered to 108 subjects who were divided into three 
equal groups; a group of normal subjects, a group of chronic alcoholics, and a group 
of paranoid schizophrenics; with an equal number of males and females in each group. 
The normal groups were made up of university students and hospital personnel. The 
female alcoholic group was comprised of new members in Alcoholics Anonymous, 
while the male alcoholics were hospital patients diagnosed as ‘chronic alcoholic 
without psychosis”. The subjects in the paranoid schizophrenic groups were selected 
by hospital personnel, on the basis of ‘‘clear diagnosis”, recency of diagnosis, age, and 
education. All groups ranged in age from twenty to sixty years and in education 
from seven to seventeen years. The responses were scored according to the deter- 
minants set forth in the manual, and the numerical scores (percentages) were 
subjected to statistical analysis. 


RESULTS 


There are three possible ways in which this test may be found to differentiate 
between the groups. (1) By the direction of aggression expressed; that is, whether 
it is extrapunitive, intropunitive, or impunitive. (2) By the type of response given; 
that is, whether it is ego-defensive, or is dominated by the obstacle, or shows a need 
to find a solution to the frustrating problem. (3) By the Group Conformity Rating 
which is obtained by comparing each record with a pattern of responses which has 
been found to occur in a sufficiently high number of records to permit its adoption 
as a norm“; ®, 

It will be recalled that the basic question raised is twofold: 1) does the P-F 
Study differentiate between variously labeled clinical groups, and, 2) does it differ- 
entiate between these groups to the extent that it can be used as a device in de- 
termining the diagnosis of unlabeled individuals? 

The first part of the question will be considered by examining the results of this 
investigation (reported in Table 2) with special reference to the significance of the F 


TaBLe 1. MeAN Scores on P-F Srupy ror Srx Groups or EIGHTEEN Sussects EacuH AND 
STANDARD DEVIATIONS OF THE INDIVIDUAL SCORES 








Groups 
Scores 





Alcoholic Paranoid 





Direction of Aggression M F M ° F 


Extrapunitive 35.3 36.5 36.4 39.3 
(11.1) (15.6) (12.6) ‘ (16.5) 


Intropunitive 30.3 29.7 af , 27.8 
( 6.4) (10.3) .5) ; ( 9.8) 


Impunitive 34.4 33.8 5.9 : ‘ 32.9 
( 7.4) (10.4) ; : (10.2) 


Type of Response 
Clete Temiaanes s 21.3 
( 9.3) 


Ego-Defensive ‘ 51.0 
(11.5) 


Need-Persistence 2 27.6 
(10.3) 


Group Conformity 
Rating 


69.3 
(10.3) 
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Tasue 2. Resvuts oF THE F Test on Scores DERIVED FROM THE P-F Srupy on Srx Groups or 
EIGHTEEN Suspsects Eacu 








D.F. E I M 0-D E-D N-P GCR 
Total 107 





Between Group 5 | 5.50** : ‘ 3.27** 
Within Group 
Between Sex 1] 12.42** ; F 12.13** 
Between Classification 2{| 5.42** 1.87 
Discrepance 2 2.36 ‘ ‘ ‘ .26 





























*Significant at the 5% level 
**Significant at or beyond the 1% level 


scores between classifications, for the direction of aggression, type of response, and 
Group Conformity Rating. Significant differences between classifications in the 
percentages of extrapunitive and impunitive responses were found, both being sig- 
nificant beyond the one percent level. The only other significant class difference was 
in the percentage of obstacle dominance responses, which was significant at the five 
per cent level. In each case, the difference between sexes is of greater significance 
than the difference between classification. An examination of the mean scores (Table 
1) for the groups shows that the source of the significant difference for the extra- 
punitive and impunitive responses lies in the deviations of the alcoholic female and 
paranoid male groups, with all other scores clustering closely around the normal. 
As to the significance of obstacle dominance responses (at 5% level), the mean scores 
show that the normal females and the paranoid females are slightly above the 
average for the groups, but are not more than one standard deviation of the raw 
scores away from the average. The Group Conformity Rating failed completely to 
differentiate between the groups in this study. 

A quick survey of Table 1 immediately answers the second part of the basic 
question—does the P-F Study differentiate between various clinically labeled groups 
of persons to the extent that it can be used as a device for determining the diagnosis 
of unlabeled persons. The difference between the mean of any diagnostic group and 


the corresponding mean for any normal group is no greater than one standard de- 
viation of the individual scores. 


Discussion 


The results of this investigation show that the test has some discriminating 
capacity. However, the three significant differences which were found, do not ap- 
pear impressive since there were no clear-cut differences between any one group and 
any other group. In view of this fact, we must conclude that these trends of 
— are not discriminating enough to justify the use of the test for individual 

jagnosis. 

Furthermore, it seems evident that the theoretical background of the test is 
not supported. As indicated earlier, Rosenzweig has suggested that paranoid com- 
plaints should be characterized by high extrapunitive scores, that compulsive and 
obsessional ones should be high in intropunitive scores, and that hysterical com- 
plaints should rank high in impunitive scores. In this investigation there is no group 
of hysterical disorders nor of compulsive disorders, unless the chronic alcoholic 
group may be considered as falling into the latter classification. In any case, the 
chronic alcoholic group did not score significantly higher than the normal or control 
group in either intropunitive or impunitive responses. The paranoid group, accord- 
ing to the theoretical prediction, should have scored higher on extrapunitive response 
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than either of the other groups. Actually, however, they gave a lower percentage of 
extrapunitive responses than did the normal and alcoholic groups. The paranoid 
males scored considerably lower than the normal males in extrapunitive responses 
and the paranoid females, although averaging a little higher, did not differ signifi- 
cantly from the normal females. 


SUMMARY AND CONCLUSIONS 


The P-F Study was administered to 108 subjects who fell into three equal 
groups; normals, chronic alcoholics, and paranoid schizophrenics; with an equal 
number of males and females in each group. 


The results permit the following conclusions: 


1. The test has some discriminatory capacity. However, the over-lapping of 
the group scoring is very great. 

2. The differences between groups are not large enough to justify the use of 
the test in individual diagnosis. The difference between the mean for any diagnostic 
group and the corresponding mean for any normal group is no greater than one 
standard deviation of the individual scores. 

3. The findings fail to confirm the theoretical validity of the test. 


4. The P-F Study should not be used routinely in clinical diagnosis until furth- 
er exploratory and experimental work yields an adequate validation. 
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BACKGROUND DaTA 


It is most unusual that test results are available on a non-psychotic patient, 
both prior and subsequent to serious brain injury. A 22-year old patient presented 
himself for treatment at the Veterans Administration Mental Hygiene Clinic in 1948 
complaining of numbness and paralysis of his left arm. Previously a physical ex- 
amination revealed no organic disease of the nervous system, and the E. E. G. was 
within normal range. This negative finding was confirmed at the time of his accept- 
ance for treatment at the clinic. At that time, he was eligible for compensation for 
psychoneurosis, anxiety type, but had waived compensation in order to join the 


This paper was read at the annual meeting of the Colorado-Wyoming Academy of Science at 
Colorado rado Springs Colorado, May 1, 1953. 
nks are due to Dr. Johann R. Marx, Assistant Chief, Denver Veterans Administration 


ag tal Hygiene Clinic for reviewing the medical material on this patient; and to Drs. Howard Siple 
- a of the Veterans Administration Hospital, Denver, for making some of the test 
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naval reserve. He was described by his therapist as having marked mood swings, 
being highly suggestible, immature and impulsive. His behavior was described as 
being loud and boisterous, obviously an attention getting device. After five treat- 
ment interviews, mainly supportive in nature, his arm returned to normal function- 
ing. His therapist attributed the change to his extreme suggestibility. However, in 
the course of treatment, he was able to discuss his hostility towards his father, to- 
wards his employer and about his reaction to military experiences. The closing diag- 
nosis was conversion reaction. While in treatment he was given the Rorschach and 
Wechsler-Bellevue. 

Subsequently, the patient attended college for one and a half years. His ad- 
justment was adequate, although he complained of nervousness and had some diffi- 
culty with school work. On a routine compensation examination, his behavior was 
described in the same terms as when he was a clinic patient. However, he had no 
complaints of paralysis. 

The patient was called back to active duty by the Navy and had some service 
in the Korean area. In April, 1951, the veteran suffered a frontal skull fracture and 
frontal lobe injury in an automobile accident. He was completely unconscious for 
several days and consciousness was regained gradually. Eight days after his injury, 
he was able to respond only to a slight degree. He had a complete right hemiparesis 
and pronounced aphasia. 

His recovery continued to be gradual. Seven weeks after his accident, the 
aphasia was gradually subsiding but it was still difficult for the patient to choose the 
correct words. At that time, there was a marked right spastic hemiplegia, only the 
shoulder and arm muscles could be moved. The muscles of the fingers, leg and foot 
were nonfunctioning. There was also a marked lessened sensitivity to pain. 

The next report is dated 7 months subsequent to the injury. It was noted that 
movement was restored to the right extremities but that there was marked inco- 
ordination with moderate atrophy of the muscles. Reflexes were hyperactive on the 
right side. The patient was described as mildly aphasic, mildly euphoric with a 
marked loss of recent memory. The diagnosis of encephalopathy was confirmed. 
The E. E. G. was abnormal, showing an irregular slowing which would be consistent 
with a diagnosis of convulsive disorder. The patient was put on anticonvulsive medi- 
cation as a prophylactic measure since there were no reported seizures. During the 
period of hospitalization, the patient was given the Wechsler-Bellevue Scale, and the 
Wechsler Memory Scale, among others. 

The patient subsequently returned to treatment at the Veterans Administration 
Mental Hygiene Clinic for psychotherapy and anticonvulsive medication. He com- 
plained of headaches, and inability to move his right arm above the level of his 
shoulder. It should be remembered that he had complained of an inability to move 
his left arm at the time of previous treatment. He also complained of a memory de- 
fect and difficulty in speech. He was described as being very immature, exhibiting 
flighty behavior, being quite boisterous and emotionally very labile. Essentially, 
his behavior appeared to be an exaggeration of the type described prior to his injury. 
The patient has been seen in treatment for 12 months, during which time he received 
both psychotherapy and anticonvulsive medication. His mother was also seen three 
times and she is described as a very dominating and overprotective woman. The 
patient also had been referred to the Denver University Speech and Hearing Clinic 
and received considerable help from this source for his aphasic condition. 

This patient has shown considerable improvement; his flighty behavior, silly 
talk, and boisterousness have all diminished considerably. His complaints of mem- 
ory loss, aphasia and inability to move his arm have disappeared. He obtained a 
position and has shown a sense of responsibility and a capacity for work which did 
not exist prior to his injury. He is still being carried on in therapy, mainly for the 
anticonvulsive medication and supportive psychotherapeutic interviews. The 
pes was tested during this period of treatment in March, 1952, 11 months after 

is injury. 
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Test RESULTS 


Wechsler-Bellevue Intelligence Scale. The weighted scores on each of the Wechsler- 
Bellevue subtests, as well as the Verbal IQ, Performance IQ, Total IQ and Deteriora- 
tion Quotent (DQ) are presented in Table 1. 


TABLE 1. WECHSLER-BELLEVUE WEIGHTED ScorEs AND IQ’s BEFORE AND AFTER HEAD INJURY 
W-B 


Subtests and i 33 Months 7 Months 11 Months 
IQ Scores i Prior After 











ll 








46 
22% 


Quantitative analysis of each of these tests, taken individually, would not indicate 
organic brain damage. The only testing which gave a DQ which might be suspicious 
was 7 months after his accident. But even here, the DQ is only 22%; a finding not 
unusual in the clinic “non-organic” population“). At this time, the patient’s profile 
revealed three of Wechsler’s “Organic Signs”, but neither Block Design nor Digit 
Symbol differed significantly from the performance mean score. Also, the Perform- 
ance scale IQ is still much higher than the Verbal scale IQ, again contrary to Wech- 
sler’s expectations. 

Eleven months after the injury, the only “organic” sign is the low Digit Symbol 
test score. The DQ is negative, and Block Design is the highest of all subtests. The 
Verbal IQ is still much lower than the Performance IQ. 

The Reynell ® index for organicity was computed for each test. The index was 
negative for the first and third tests, but positive (14 points) for the test at 7 months. 
The Hewson“? ratios were also computed. These were remarkably consistent; for 
each of the tests the same two of the eight ratios were positive. Apparently the brain 
injury had no effect on these ratios. 

However, while analyses of the individual tests were not revealing, a comparison 
of the tests is most rewarding. Comparing the test results at 7 months after injury 
with the test prior to injury, we find that 9 of the 11 subtest scores have changed in 
the direction expected by the Wechsler signs. There is a marked decrement in 
capacity for memory (D), abstraction (S), perception and synthesizing (BD), and 
new learning (DS). The improvement in Vocabulary and Picture Arrangement 
scores seems to be related, in part, at least, to the patient’s attitude and co-operation, 
since on the first test he manifested behavior consistent with the personality picture 
described above. On the second testing, he was extremely cooperative. In terms of 
1Q, it can be noted that his performance IQ is approximately the same. On the other 
hand, there is a marked decrease in the verbal IQ which is consistent with the apha- 
sia. In addition, the sum of the ‘Hold”’ tests has remained consistent, whereas the 
“Don’t Hold” tests have dropped 14 points. 

In contrast to the comparison of the first and second testings, the first and third 
testings show little consistency with ‘‘organic’’ expectations. Memory capacity (D) 





Don’t Hold 
D. Q. 
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and new learning (DS) still show some decrement, although less than on the second 
test. Arithmetic and Block Design, which were lower on the second than the first 
test, now surpass the scores prior to injury. A comparison of the second with the 
third test indicates that most of the improvement was found in the ‘Don’t Hold” 
tests. It is likely that some of these increases are due, in part, to practice effect since 
there was only a 4 months interval between the last tests. On the other hand, the 
better performance may be due to a reduction of the effects of the damage and return 
to the pre-injury level. This interpretation of the change would be consistent with 
the clinical impressions at the time. We were aware of the possibility that there 
might be some practice effect on the last testing; however, the advantages of com- 
parison on the same material outweighed the disadvantages. 


Rorschach. The patient was given the Rorschach only at the first and third testing. 
The scoring and tabulation are presented in Table 2. 


TABLE 2. Rorscuacu ScoRING AND TABULATION 








Card No. 33 Months Prior 11 Months After 
I WF+AP WF+AP 
DF + Hd 
DF-A 
WF+A DF+AdP 


Rejects DM+HP 
DF+Cg 


WCF-A Rejects 
WFM+AP WF+AP 
Rejects 





Rejects 
Rejects 


Rejects 














It will be noted that this veteran gave only four responses the first time and eleven 
the second time he took the Rorschach. Detailed perusal of the second record failed 
to reveal the usual signs of the effect of the accident; neither Piotrowski’s“) nor 


Hughes’? lists of “organic” signs gave any clue to the fact that this man had had a 
frontal lobe injury within the year. 
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At the last testing, the Rorschach gives evidence of the patient’s attempt at 
controlling his behavior and channeling his behavior into more acceptable avenues. 
Clinically, this striving for control was also observed. There seemed to be a wavering 
between his boisterousness at this time and his rigid attempts to control himself. 


Wechsler Memory Scale. The Wechsler Memory Scale was also given 7 months and 
11 months after the accident. In contrast to the Wechsler-Bellevue results, there was 
little change during this period. The Memory Quotient went up from 90 to 94. The 
change was due almost entirely to the increased digit span which was also in evidence 
on the Wechsler-Bellevue. 


Discussion AND SUMMARY 


The literature in this field is not at all helpful, primarily because the subjects 
used are either the hopelessly psychotic, the very severe neurotics or patients with 
intractable pain. Usually the opportunities for adequate preoperative testing are 
obviously extremely limited. Struckett quotes 13 studies in this field: 4 reported 
a loss in intelligence test scores, 4 reported no gross decrease, 2 reported an increase, 
while 3 observed no significant difference. Reasons for the lack of agreement can 
readily be found including: the selection of subjects, operative approaches, absence 
of adequate controls, ete. 

Many studies, including the Columbia-Greystone“ and the Struckett studies 
indicate that the greatest loss occurs immediately after the operation or injury, and 
that in 6 months or so the loss is negligable. Our patient did not follow this pattern. 
Of course, we do not have tests for the period immediately following the injury. But 
at 7 months he still showed very pronounced losses. It was only at 11 months after 
injury that he had surpassed his previous level. 

This paper emphasizes the fact that the effects of organic brain damage do not 
always show up in the expected manner. We cannot assume that all types of injury 
to the brain will result in similar test behavior. Nevertheless, the clinician often 
functions as if this were true, and as if the effects were permanent. Yet, there are 
many studies, in both the human and animal field which indicate that much brain 
tissue can be lost without having a marked prolonged effect. However, there ap- 
parently are more subtle cues which the psychologist uses which have not yet been 
quantified to a degree that can be used as “‘signs.”’ 

Another major difficulty is the fact that we usually do not know the person’s 
pre-injury level, either in personality or intellectual functioning. That is, we cannot 
assume that every patient is normal to start with, and that the deviations will all be 
from the same level. In this particular case, the patient did change in the expected 
manner on the Wechsler-Bellevue but the only way we could have known about it 
was by having the original test. On the Wechsler-Memory Scale, while there is a 
slight increase from the second to the third test, the patient’s starting point is un- 
known. While the Rorschach was consistent with the clinical impression of the 
personality, the test did not contribute to the diagnosis of organicity. However, 
other tests at the last testing along with behavioral cues did indicate the possibility 
of organic brain damage. 
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RORSCHACH PATTERN ANALYSIS IN SCHIZOPHRENIA: A CROSS- 
VALIDATION STUDY! 


EARL 8. TAULBEE and 
Norfolk State Hospital, Norfolk, Nebraska 


BOYD D. SISSON 
University of Nebraska 


PROBLEM 


In a recent publication Thiesen “? reported five patterns of Rorschach variables 
which he found to be associated with schizophrenic patients and nonoccurring or 
extremely rare in normal subjects. While these patterns significantly differentiated 
the two groups, it was stated that they must be regarded as tentative inasmuch as 
the groups on which the patterns were tested were the same groups from which they 
were derived, and therefore needed to be confirmed on new groups. The following 
study reports the results of a cross-validation of the applicability of the patterns using 
ing new patient and control groups. 


TABLE 1. PaTTeRNs ASSOCIATED WITH SCHIZOPHRENIA 








Thiesen Present Study 


Descrip- Controls | Patients Controls | Patients 
tion (N 157) (N 60) (N 157) (N 62) 











An: High! 12 3 , 0 
Sex: High 
F+%:Low 
Z Score: 
Low 
FC+: Low 
M: Low 
A%: Low 
F+%:Low 
¥: Low 
A%: Low 
DW: High 
FC+: Low 









































*Because of the low cell frequencies involved, Fisher’s “exact test’’ was used for determining the 
probabilities. 


1. An: High (3 and above—excluding sex responses) 
Sex: High (1 and above) 
F+%: Low (0-68.9) 
Z score: Low (0-7.5) 
FC+: Low (0) 
M: Low (0) 
3 Lg (0-33.9) 
0-4) 


DW. High Q and above) 


1From the Veterans Administration Hospital, Knoxville, Iowa. 
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SUBJECTS 


Subjects used in this study were 62 hospitalized adult male schizophrenic pa- 
tients, so diagnosed by the psychiatric staff of Knoxville, Iowa, Veterans Admin- 
istration Hospital. The control group consisted of 157 ‘ ‘normal” men and women 
from a small midwestern community,’ which is located in the same general geograph- 
ical locale from which the patients came. The age range for the patient group was 
ee pe age 28, SD 5.16; for the control group the range was 20-45, mean age 
33.8, 6.89. 


PROCEDURE 


The Rorschach test was individually administered to all subjects in both groups 
and scored according to the revised Beck system“). Each record was examined for 
the presence of one or more of the five patterns described in Table 1. Derivation and 
further description of these patterns are contained in the previous paper ®?. 


RESULTS 


The assumption of homogeneity with respect to total pattern occurrence for 
the two groups was tested by chi square, employing Yates’ correction for continuity, 
and rejected at the .001 level of confidence. 

With respect to the individual patterns it is shown in Table 1 that the dis- 
criminating efficiency of Patterns A, C, and D is confirmed, whereas Patterns B and 
E fail to meet the acceptable statistical criterion. 

In the present study only one multiple pattern, a C-D combination in the schizo- 
phrenic group, was obtained; in contrast, the earlier findings reveal that eight of the 
patient group gave two or more patterns. 

The patient group consisted entirely of male subjects, whereas the control group 
was composed of 103 females and 54 males. A chi square of .079 between sex and 
pattern occurrence, however, allows inference that pattern discrimination for one 
sex is not significantly larger than for the other. 


Discussion 


The authors are in agreement with the interpretation offered by Thiesen that 
Pattern A is suggestive of “somatic preoccupation and release from culturally de- 
termined inhibition,” which are not unlikely characteristics of a schizophrenic dis- 
turbance. The possibility that this pattern might be more common in selected popu- 
lations of specialized interests or sophistication in the absence of psychopathology, 
as mentioned in the earlier study, remains unanswered, but it did not occur in any 
of the present control group. Likewise, it is agreed that Pattern D suggests a dis- 
turbance in affective contact with the environment. The confirmation of Pattern C 
offers no additional explanation of the interrelationship among the factors compris- 
ing this pattern. 

Failure to confirm Patterns B and E may be attributed in part to the lack of 
homogeneity with respect to pattern occurrence in the two schizophrenic groups as 
evidenced by a chi square of 4.82 (P = .03). A suggested explanation for this differ- 
ence is that the present patients were new admissions whose psychoses were in early 
stages of development or recurrence, and one half of whom were diagnosed schizo- 
phrenic reaction, paranoid type. For this group the mean Wechsler-Bellevue IQ 
was 100.65 (obtained within two days of Rorschach administration), and the mean 
F+% was 72. Also, compared with Thiesen’s ‘‘normal’’ group, the present patient 
group obtained a significantly higher Z score and there were no significant differences 
between these two groups for A% and number of FC responses. While it is obvious 
that these factors are in opposition to some of those appearing in the patterns not 
validated, it is felt that their occurrence among schizophrenic patients where the 


*The records were taken from a larger study done under the supervision of Dr. Marshall R. 
Jones and supported by the University of Nebraska Research Council. 
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illness is not advanced and among those of the paranoid type is not uncommon. 


There were no significant differences between the two control groups for the presence 
of patterns. 


SUMMARY 


In a previous study Thiesen reported five Rorschach patterns associated with 
schizophrenia. The present study presents the results of a cross-validation based on 
a group of 62 adult schizophrenics and a group of 157 “normal’’ controls. Three of 
the patterns were found to be discriminating. A probable explanation is offered for 
the failure of the remaining patterns to be validated. 

The authors believe that these results lend support to Thiesen’s statement that 
when the patterns occur they may have clinical usefulness in early detection of a 


schizophrenic process or as an aid to determining the gravity of an initial psychotic 
episode. 
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A MECHANICAL SHORT-CUT FOR COMPUTING TETRACHORIC 
CORRELATIONS 


ANDERS SWEETLAND AND RALPH MASON DREGER 
Florida State University 


INTRODUCTION 


As research accumulates in clinical psychology, the complexity of clinical prob- 
lems becomes more and more apparent. In view of this complexity, more psycho- 
logists are looking towards factor analysis, as it seems to be a tool of sufficient stature 
to encompass the dimensions of clinical problems. 

At first glance factor analysis is discouraging: the computational procedures are 
complex and time consuming; but even more forbidding is the number of correlations 
involved. An innocent-appearing study involving twenty elements necessitates the 
computation of 190 correlations. Even the thought of this can be traumatic, particu- 
larly since most psychologists do not have IBM facilities. However, if the problem 
of mass production of correlations could be resolved, factor analysis might become 
accessible to many psychologists who have previously been discouraged from using it. 

In attempting to solve this problem we set up three desiderata: (1) That the 
method be inexpensive; (2) that it be reasonably economical of time; and, (3) that 
it be simple enough to be handled by a student assistant (who is frequently a college 
sophomore without mathematical sophistication.) 

To reduce computational time, the first consideration is generally that of using 
tetrachoric correlations. However, the reduction of the groupings from the number 
in a scatter diagram results in a loss of accuracy: as the number of intervals is re- 
duced the standard error increases. This objection can be surmounted by increasing 
the size of the sample, thus decreasing the error. But to increase the sample size in- 
volves additional clerical work, so the psychologist is back where he started. What is 


needed is a means of increasing sample size without an appreciable increase in clerical 
work. 
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PROCEDURE 


After formulating and rejecting several possibilities we decided to use punched 
cards for sorting of items and to weigh rather than count cards. (Any balance sensi- 
tive to 0.1 gram could be used.) After the cards were sorted into four categories as 
required for the tetrachoric r, each stack of cards was weighed and the weights used 
in the computations. A further reduction in the size of the standard errors was ob- 
tained by using a sorting division close to the mean of the sample for each item or 

test. 

The following is a description of the testing of the procedure. An attitude scale 
of fourteen items was used with each item subdivided into an 8-point scale. The scale 
was administered to 350 college students. A sample of 75 scales was tabulated. After 
means were computed for the fourteen items, a template was prepared which ex- 
posed only the items above the mean in each case. Each individual’s scores were 
then encoded on a 3” x 5’ card. These cards were made by clamping them between 
two boards and drilling 14 holes around the edges, spaced approximately 5/8 of an 
inch apart, 4 inch from the edge. Using a drill press speed of 1000 r.p. m., and 
counter-boring with a slightly larger drill (#3 drill counterbored with a #1 drill), 
we could obtain holes with practically no burring. Clean holes are necessary for ease 
in sorting. One corner of each card was clipped for purposes of alignment. 

In scoring, the student assistant placed the template on the scale sheet, and, 
with a scissors, clipped out the edge of the corresponding hole on the card of any 
checked item showing. These operations were rechecked by another student for 
accuracy. 

A knitting needle, made from a coat hanger, was used to sort the cards. The 
needle was first run through the item 1 holes and lifted up. Items that were answered 
“ves’’ (i.e., above the mean of item 1) fell to the table and the ‘‘no’s” were left hang- 
ing on the needle. With these two stacks separated, the same procedure was followed 
for item 2, each of the item 1 stacks being sub-divided into above mean and below 
mean of the item 2 groups. The four stacks of cards obtained (item 1, yes —2, yes; 
item 1, no —2, no; item 2, yes —item 1, no; item 2, no —1, yes) were weighed on the 
balance. The cards were restacked, and item 1 and item 3 sorted. Again the four 
stacks were weighed. A similar process was followed to obtain the 91 correlations. 


TABLE 1. DistTRIBUTION oF 90 CoRRELATIONS IN ToTaL Grams OBTAINED 
BY ADDING THE Four QUADRANTS 








Grams Frequency 


512 
511 
510 





* 


- 


SOSwmounwnnoonooceor 


_ 


CroowsH.S 


491 





*Might be reweighed for possible clerical errors 
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With these weights tabulated, the tetrad of cells (or quadrants) were added by rows 
and columns and the correlations computed using Hayes’®? charts. The Chesire- 
Saffir-Thurstone“? tables or the cosine pi method could also be employed. By use 
of the Hayes’ charts and a slide rule, computations take approximately two minutes. 

A check on the accuracy of this technique can be made by glancing at the table 
of total weights. Any aberrant. measures are immediately apparent. An idea of the 
accuracy of sorting can be seen in Table 1 which shows the total weights obtained by 
adding the four quadrants. The error is less than one per cent in 94% of the cases. 

In checking to see how well we had met the desiderata, a final ‘‘expense sheet”’ 
was compiled: (1) total expense was sixty cents for four hundred 3” x 5” cards, plus 
seventy-five cents per hour for the assistants; (2) the total time consumed (including 
preparation for correlations) was approximately forty minutes per correlation. 
(Since the experimental trial included necessary delays, we believe that the time could 
be halved by a more organized approach); (3) an average error resulted of less than 
one per cent. 

It is possible that the time might be further reduced by measuring the stacks of 
cards with a micrometer. We did not follow this practice as the 3” by 5” cards tend 
to distort in handling. The result is a ‘“‘spongy”’ stack which is not suitable for micro- 
meter measurements. Commercial cards (McBee, Keysort) would be much less 
prone to this shortcoming. 


Briefly, the procedure is: 
1. Determine means of items from sample of total N. 
Make template(s) cut out above the mean. 
Place template(s) on each scale, cut edges of holes where checked items 
show through. 
Stack all cards and divide into four stacks with needle. 
Weigh cards in each stack and record weights in 2 x 2 table. 


Repeat #4 and #5 for each item. 
Compute tetrachoric r’s by diagrams. 


SUMMARY 


A short cut for computing tetrachoric correlations involving the use of card 
sorting and weighing is described. Total cost for a sample involving 350 cases in- 
volving 91 correlations was less than one dollar for materials. The time was less than 
40 minutes per correlation and the reliability of scoring showed an error of less than 
one per cent. 
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THE LEARNING PLATEAU SEEN AS RESISTANCE IN INDIVIDUAL 
REMEDIAL TEACHING 


MONROE G. GOTTSEGEN AND GLORIA B. GOTTSEGEN 


Inter Agency Guidance Center, The Developmental School, 
Yonkers, N. Y. New York, N.Y. 


INTRODUCTION 


It is our experience in work with children referred for remedial reading, that one 
of the most significant occurrences which can act to negate achievement is the learn- 
ing plateau. This is most apparent in individual remedial sessions when the child 
possesses an emotional difficulty. However, even with the normal child, whose re- 
ferral seems essentially due to school failure of a non-neurotic nature, the learning 
plateau occurs to interfere with achievement“. The generalized blocking of the 
plateau may reflect the displacement of unconscious resistances seen as negative atti- 
tudes toward general as well as specific learning so that failure to progress can be 
better understood in terms of the emotional concomitants which underlie the struc- 
ture and relationships of the remedial hour”. 

It is generally accepted in individual remedial teaching that. a necessity exists 
for distinguishing between structure and permissiveness®?. The degree of structure 
within the learning situation is determined by the original diagnosis of the specific 
reading handicap, coupled with individual capacity for learning, and based upon the 
emotional strengths and weaknesses of the child. In working with the child, keeping 
specific goals in mind, it is necessary to plan carefully the selection and use of mater- 
ials. On the basis of the educational diagnosis made, the remedial teacher must allot 
time for work on specific areas necessary for correcting apparent disabilities, i.e., a 
typical remedial reading hour may consist of oral reading (10 minutes), work with re- 
lated word recognition techniques and games (15 minutes), comprehension exercises 
(10 minutes), and story reading (10 minutes). 

The time allotted to such activity is adhered to with a rather consistent degree 
of regularity. Permissiveness is shown during the activity undertaken in terms of the 
teacher-pupil relationship. It is of note that whether or not the remedial reading is 
being given while the child is undergoing psychiatric treatment, the therapeutically- 
oriented teacher nevertheless remains a teacher, and does not attempt to deal directly 
with observed attitudes and underlying affect. In the kind of remedial hour des- 
cribed, it is not unusual to find, after an initial learning spurt, a tendency for the 
child to block, reach a learning plateau, and then to regress. This seems to occur in 
spite of the attempt to vary materials within the hour, introduce new activities of a 
related nature, and thereby continually stimulate interest. 

When blocking initially appears, it begins to show itself as a fairly isolated 
function of one specific learning area. However, the tendency then is for the block- 
ing and related inefficient educational functioning to spread to the extent that it 
encompasses a majority of reading areas and skills. The child at the same time may 
be coming late or showing other forms of resistance of a more generalized nature. 
Upon conferring with parent and /or therapist, there may appear no overt indication 
upon which to base an explanation of the learning block. The plateau reached ap- 
pears as if out of nowhere and, if ignored, can retard learning for a considerable period 
of time. It is the duty of the remedial reading teacher to exert all effort in over- 
coming the learning block. In many cases, successful remedial tutoring must rest 
upon the basis of whether the impinging regressive phenomena are dealt with and 
overcome. 

After the initial spurt which may be due to expectations of quick progress on 
the part of the child, a wish to please parents and friends, or an initial fascination 
with the techniques and procedures useful in the hour, the novelty and fantasy ex- 
pectations of quick success begin to wear thin when confronted with the reality of a 
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learning situation which depends for its success upon productive achievement. The 
child is aware, no matter how permissive and accepting is his teacher, that he is 
expected to learn in the hour, and to apply effectively those skills and techniques 
mastered, to school and home situations. Whatever the attitudes originally inter- 
fering with productive learning in school, they are at this point partially displaced 
in the form of resistances to the remedial hour. Prior success within the reading hour, 
contrary to expectation, is not sufficiently supportive for overcoming the resulting 
jlateau. 

, A hostile child who has found it difficult to be directly aggressive in his relation- 
ship with parents and who has not adjusted in peer relationships may transfer the 
latent negative attitudes toward authority (as seen in failure to apply sufficient 
energy for academic achievement) to the remedial teacher. If, in continuing the 
above example, negativity appears as an expression of hostility in school endeavour, 
it will also make a similar appearance in the tutoring hour. It is important to realize 
that the permissiveness of the remedial teacher will encourage the display of the de- 
fensive mechanisms and increase tendencies to displace and project. Again, a 
youngster who is known to be passive and withdrawn, deriving major satisfactions 
from a wish-fulfilling fantasy, will dawdle, procrastinate, lose his place, and function 
with extensive inefficiency under similar circumstances. If the remedial teacher 
anxiously reacts with a lessening of permissiveness and a more rigid adherence to 
structure, he will create further anxiety in the child, rather than dispel it, and as a 
result totally destroy the opportunity for a reorientation toward new learning. If 
he ignores the resulting phenomena inherent in the seemingly unexplained failure 
to learn, and in the behavioral expression of resistances, he will find himself frustrated 
in his teaching and of little help to the child. The remedial teacher cannot ignore 
the presence of a resistance simply because the pupil may be receiving psychothera- 
peutic treatment elsewhere. The child himself basically desires that his behavior 
be dealt with constructively, without having, as in the past, to suffer conflict stem- 
ming from a reactive anxiety on the part of the school and home. 


Case STUDIES 


Two cases follow in which resistances were encountered and dealt with success- 
fully. The first is that of a pre-psychotic boy who was receiving play therapy as well 
as individual remedial reading. The second concerns a youngster who exhibited no 
observable abnormal processes, either in the nature of the affective controls, the 
quality of conceptualization, or the manner of relating to peers and adults. 


Vince, aged 9, tested at the borderline level of intelligence on the Wechsler Intelligence Scale for 
Children. He was considered a culturally handicapped youngster whose ability to learn was compli- 
cated by a pre-psychotic emotional orientation. A neurotic defensive attitude was employed by Vince 
to deny his deep doubts concerning his failure to learn or compete with classmates. As a result of being 
dominated, yet rejected, by his older brother who used every means possible to destroy Vince’s self- 
esteem, Vince possessed a sincere belief that he was entirely incompetent, stupid and defective. In 
defending himself against experiences which could produce anxiety, he reacted by refusing to attempt 
any productive goal-seeking activity, and in doing so, became hyperactive and facetious, clowning 
his way out of anxiety-arousing learning situations and relationships. Although overtly conforming, 
the behavior exhibited showed indirect evidences of intense hostility expressed in childish evasiveness 
and negativity. Vince’s purposely obnoxious behavior inevitably aroused the hostility of his teachers. 

As a result Vince was seen for play therapy on a once-a-week basis, and for remedial reading, two 
hours per week. Although the behavior described above did not appear during the first few hours of 
the remedial sessions, it nevertheless shortly occurred. Resistances took the form similar to the kind 
described above. Dealing with the resistances meant almost completely doing away with the mini- 
mum amount of structure involved in the initial reading sessions. Vince was read to, for approximately 
twenty minutes; simple games and other reading activities of a readiness nature were depended upon 
exclusively with the central purpose being to entertain and amuse under the guise of teaching reading. 
The purpose here was to allay anxiety by permitting the rationalization of achievement, to build re- 
lationship and stimulate identification, and to make unnecessary the employment of defensive be- 
havior as seen in the child’s approach and attitude toward new learning. When anxiety had decreased 


ee the defenses could be relaxed, and it was possible to introduce a more direct teaching 
approach. 
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Dick, aged 8, achieved a classification of low-average intelligence on the Stanford Binet L exam- 
ination. He was, at the time of referral, functioning at middle to upper first grade level in his reading. 
His work habits were characterized by extensive sounding out of words, by exclusive use of a phonetic 
approach, and a deliberate attemyt to solve rather than to read material presented to him. There was 
no indication that Dick had ever grasped the concept of reading in phrases. He was the perfect ex- 
ample of the word by word reaaer who was fascinated by the results of his analytical approach. Within 
a short period of time, on the basis of two hours per week, Dick gave evidence of being able to increase 
his word recognition abilities to the extent that he was shortly reading in meaningful phrases. He 
was able to take obvious pleasure in free reading assignments which stimulated his imagination. 

Following this initial achievement, in which his reading efficiency showed a rapid and effective 
increase, he was observed to become extremely careless in his word recognition, demonstrated a 
tendency to lose his place with more th:.n a fair degree of regularity, and to be overly concerned with 
illustrations and game materials for which he had never in the past shown a great liking. The diffi- 
culties first appearing in the oral reading were shortly transferred and extended to the word recognition 
exercises and games. When the remedial teacher became aware that the learning inefficiencies and 
growing negativity were playing an increasingly dominant role, attempts were made to utilize, extend, 
and to give a more prominent place to, the pleasurable parts of the reading hour. The remedial teacher 
spent a goodly portion of the time in reading stories to Dick. He increased and rotated the game 
material on the theory that the correlated oral reading activity permitted a greater opportunity for 
concretizing resistances and generating anxieties. Oral reading was cut to minimum and eliminated 
for quite a few sessions. After eight sessions the efforts of the teacher to counter resistance by changing 
the character of the authority-initiated routine, creating a further positive basis for relationship, and 
not extending the area of conflict by responding with anxiety, permitted a slow reintroduction of some 
of the resisted tasks, thereby gradually overcoming the learning plateau. The remedial teacher was, 
in effect, attempting to show Dick by deed resulting from empathic responsiveness, that the dis- 
re maps resistances were untenable in terms of the reality relationship; that by being able to demonstra- 

ly modify the basis upon which the teacher-pupil relationship was formed, it was possible for Dick to 
acknowledge flexibility in attitudes toward authority relationships. In short Dick was given observ- 
able emotional support for altering his distorted attitudes toward his teacher, who was originally con- 
ceived of as an accepting, but nevertheless authority, figure. 


Because a youngster’s attitudes toward adults are derived from relationships 
in school and family, it is unavoidable that the child views future adult contact in 
terms of past contacts. Experience shows that permissive conditions encourage pro- 
jection and displacement of feeling and attitude. It is therefore apparent why the 
child, anxious about his failure in school, and tending to show evidence of the denial 
of his own responsibility by developing, through rationalization, negative attitudes 
toward teacher and other authority figures who demand achievement, will, when 
permitted, overtly display related affect and attitude under conditions which permit 
their expression. In effect, learning plateaus evidenced in the remedial hour seem to 
be an expression of hostility toward authority adults, who play and who have played 
a major role in the life of the child. 


SUMMARY 


Two remedial reading cases, one of an emotionally disturbed and the other of a 
normal boy, were cited in order to show how resistances appearing in the form of 
plateaus occur and how they may be dealt with by the educational psychologist 
within the remedial reading hour. It is concluded that the teacher himself has the 
responsibility to recognize the plateau when it occurs, understand it in terms of 
emotionally related attitudes governing approaches to new learning, and attempt to 
deal with it through manipulation of the relationship and learning environment. 
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CONCEPT FORMATION OF NORMAL AND SUBNORMAL 
ADOLESCENTS ON A MODIFICATION OF THE WEIGL-GOLDSTEIN- 
SCHEERER COLOR FORM SORTING TEST 


ARNE KORSTVEDT, CHALMERS L. STACEY, AND WILLIAM F. REYNOLDS 
Syracuse University* 


PROBLEM 


In 1937, Bolles“ reported on a study of concept formation in which she used 
three groups of subjects: ten adult aments, nine adult dements, and ten normal 
children. The test used by Bolles was the Weig] Form-Color Sorting Test. She con- 
cluded that the aments tended to sort the figures on the basis of form, and also show- 
ed an inability to shift attitude voluntarily and sort the figures according to another 
concept. 

The major drawback of the Bolles study was the very small number of subjects 
included in the experiment. The major purpose of the present research was to study 
the concept formation of normal and subnormal subjects, using a relatively large 
number of subjects, on a modification of the Weigl-Goldstein-Scheerer Color Form 
Sorting Test. 

METHOD 


The subjects for the experiment were 50 normal and 75 subnormal females, 
ranging in age from 15 to 18 years. The normal girls had a mean IQ of 109. The 
subnormal subjects were divided into a moron and a borderline group. The 45 moron 
females (48-69 IQ) had a mean IQ of 60. The 30 borderline subjects (70-79 IQ) had 
a mean IQ of 74. The IQ’s of the subnormal subjects had been determined by the 
use of the Revised Stanford-Binet, Form L. The IQ’s of the normal subjects had been 
determined by the use of the Otis Self-Administering Test of Intelligence. 

The test employed in the present study was a modification of the Weigl-Gold- 
stein-Scheerer Color Form Sorting Test®’. The materials included 24 plastic fig- 
ures: six squares, six equilateral triangles, six acute right triangles, and six four-sided 
parallelograms (diamonds). One figure of each form was in each of the following 
colors: red, yellow, green, blue, black, and white. All figures were white on the re- 
verse side. The concept formation test was administered individually to every sub- 
ject. 

The testing procedure was as follows: 


1. The figures were placed before the subject in a random order, colored side 
up. The subject was asked to group the figures that seemed to belong together, and 
to state the principle of his performance. 

2. The subject was asked to sort the figures in a way different than the way he 
had previously, and to state the basis of the grouping. 


3. For those subjects who had grouped the figures by both form and color on 
trials 1 and 2, the experiment was over. Some subjects grouped by only one category 
on the first two trials. Other subjects “patterned” the figures in some form of de- 
sign. For any subjects not successful in sorting the figures by both form and color 
on the first two trials, the following procedures were followed: 

(a) If the subject grouped only by form on trials 1 and 2, the experimenter 
sorted the figures by color and explained the basis of the grouping. Then, the first 
two steps of the procedure were repeated, but the trials were called trials 3 and 4. 

(b) If the subject had sorted only by color on trials 1 and 2, the figures were 
turned white side up, eliminating the color factor. If the subject was still unable to 
sort by form, the experimenter demonstrated and explained the grouping. The sub- 
ject was then given trials 3 and 4. 


*The authors wish to express their gratitude to Dr. S. W. Bisgrove, Senior Director, Syracuse 
State School, and to Miss Catherine Heffernan, Guidance Counselor, Marcellus Central School, for 
their cooperation in this study. 
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RESULTS 


From Table 1, it can be seen that, in all but one comparison, significant differ- 
ences were obtained between the normal and subnormal subjects in their perform- 
ance on the concept formation test. The normal females were much more successful 
in sorting the figures by both color and form than were the subnormal subjects. This 
result is in harmony with the results of the Bolles study. Table 1 also reveals that 
there were no significant differences between the borderline and moron subjects. 
Both groups had equal difficulty in sorting by two concepts, in that less than 25 per 
cent of both groups sorted by both color and form on the first two trials. 


TaBLE 1. Sortinc PERFORMANCE OF ALL SuB! cTs ON THE First Two TRIALS 








Normals | Subnormals | Critical | Morons | Borderlines 
Sorting Performance N =50 N =75 Ratio | N=45 N=30 





: —_ by Both Form and 41 18 S11" 1] 7 
or 





. Sorted by One Concept only 17 2.54* 10 





(a) Sorted by Color only 7 1.56 4 
(b) Sorted by Form only y 6 


. Sorted by Neither Con- . 13 
cept (Patterned) 


. Sorted First by Color 
. Sorted First by Form 


. Stated Basis of Sorting 
(Color 1+2a) 


. Stated Basis of Sorting 
(Form 1+2b ) 























. Sorted by Form only after 
White Side Turned up 
(2a+8) 























*P is less then .05. 
**P is less than .01. 


A much higher percentage of subnormals than normals were only capable of 
sorting by one concept on trials 1 and 2. Over one half of the total group of sub- 
normals patterned the figures on the first two trials. The moron group tended some- 
what more to this unsuccessful response than the borderline group, but the difference 
was not significant. Only 6 normal subjects could not group by either concept on 
trials 1 and 2. Eight normal subjects, unable to sort by form on trials 1 and 2, grasp- 
ed the concept when the figures were turned white side up. Of the 47 subnormals 
who were unable to sort by form, only 18 of them were able to when the figures were 
turned white side up. 

It can be seen from Table 1, that the subnormal subjects tended strongly to 
sort the figures first by form. The normal subjects showed a slight, insignificant 
tendency to sort first by form. The subnormal group’s preference for form was ac- 
counted for by the borderline group, which sorted first by form by more than three 
to one. The moron group showed no such preference, sorting first by each concept 
an equal number of times. This result is in direct opposition to the findings of Bolles 
that aments tend to sort first by color. 

It can be seen from the data in Table 2, that 9 of the 50 normal subjects were 
required to take trials 3 and 4. Every one of them was able to group the figures by 
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Tasie 2. Sortina PerroRMANCE OF NorMAL, BORDERLINE, AND Moron Sussects ON TRIALS 
3 AND 4 








Subjects Who Sorted By 
IQ Group Number of Subjects Both Form and Color Critical 
Given Trials 3 and 4 N % Ratio 





Normal 9 100 





2.00* 
A? 


Borderline 23 70 





Moron 34 68 














*P is less than .05. 


both form and color following the demonstrations. Of the 23 borderline subjects 
who took trials 3 and 4, 16 of them profited from the demonstrations and sorted 
successfully by both concepts. Of the 34 moron subjects who took trials 3 and 4, 23 
of them were able to group the figures by both color and form. It is of interest to note 
that approximately the same percentage of moron and borderline subjects succeeded 
after the demonstrations. Approximately 30 per cent of each group failed on trials 
3 and 4. The difference between the normal and subnormal groups on trials 3 and 4 


was significant. There was no difference between the moron and borderline groups 
on trials 3 and 4. 


SUMMARY 


A modification of the Weigl-Goldstein-Scheerer Color Form Sorting Test was 
administered to a group of 50 normal and 75 subnormal adolescent females. The 
subnormal group was composed of 45 morons (48-69 IQ), and 30 borderlines (70-79 


IQ). The materials included 24 plastic figures in four designs and six colors. Each 
subject was given two trials to sort the figures by color and form. Subjects who were 
not successful in grouping by both concepts had the sortings demonstrated to them, 
and then were given trials 3 and 4. 

In all but one statistical comparison between the normal and sub-groups on 
trials 1 and 2, large and significant differences were found. The subjects with normal 
intelligence were much more successful in sorting the figures by both color and form. 
All statistical comparisons between the moron and borderline subjects were insig- 
nificant, indicating that both subnormal groups had about equal difficulty on the 
concept formation test. Borderline subjects showed a decided tendency to sort first 
by form, while normal and moron subjects showed no preference. All normal sub- 
jects who took trials 3 and 4 were successful in sorting by both color and form. In 
both the moron and borderline groups, approximately 70 per cent of the subjects 
who took trials 3 and 4 were able to sort by both concepts. The difference between 
the normal and subnormal groups was statistically significant. 
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INTRODUCTION 


In evaluating the utilization by the military services of the psychiatrically 
marginal individual, the major attention centers around whether or not the in- 
dividual concerned can successfully complete a stated period of military duty without 
the occurrence of a psychiatric breakdown which necessitates medical discharge from 
the service. Little attention is paid to the quality of the service rendered. In a pre- 
vious article“ the present authors studied the hospital and disciplinary records of a 
group of known psychiatrically marginal individuals who completed a three-year 
period of military duty terminating in honorable discharge. The incidence of hos- 
pitalization and disciplinary difficulty for this marginal group, however, was much 
greater than that found for a control group of normal individuals. Because of the 
importance of these findings for military manpower policy and planning, it was de- 
cided to repeat the investigation using a new group of subjects. 


PROCEDURE 


Both the experimental and control groups used in the present study consisted 
of recruits taking their basic training at the U. 8. Naval Training Center, Newport, 
R. I. during the year 1943 who were referred to the Psychiatric Unit for observa- 
tion because of suspected neuropsychiatric difficulty and who fulfilled the following 
criteria: they must have been studied by at least two staff members with unanimous 
agreement reached, either as to psychiatric diagnosis or the absence of positive find- 
ings (“normal”); they must have been adjudged capable of service and returned to 
full duty; they must subsequently have served at least three years of active duty 
terminating in honorable discharge from the service. The normal control group con- 
tained 79 subjects. The experimental groups consisted of 65 neurotics of varying 
type, 31 schizoid personalities, 28 alcoholics, 69 individuals of low intelligence 
(mental age below 12 years), and 45 asocial psychopaths. These men represent every 
available case under these diagnoses that could be found in the Unit’s records for 
1943 and that fit the somewhat exacting criteria stated above and whose subsequent 
service records were available at the time of the study. It should be stressed that the 
original diagnostic findings were entered only in the Unit’s professional records and 
not on the recruit’s permanent health record where it might prejudice his subsequent 
career in the Navy. 

The hospital and disciplinary histories of these men were then followed until 
December 31, 1945, an average period of two and one-half years. The analysis of 
hospitalization and disciplinary difficulty was made directly from the medical and 
service records of each man. These were obtained from the Naval Records Manage- 
ment Center, Garden City, Long Island. Some cases originally intended to be in- 
cluded in the sample were lost owing to the unavailability of service records, but 
checks introduced in previous studies of ours have shown that this does not entail 
any systematic bias®: *). Hospitalization was defined as any admission to a Naval 
hospital for any reason whatsoever, and disciplinary difficulty was defined as any 
court-martial, ranging in severity from a Deck Court to a General Court-martial. 


*This study is part of a larger project subsidized by the Office of Naval Research under contract 
T7onr-45011 with Northwestern University. Thanks are due Mrs. Jean Moore, and the staff of the 
Naval Records Management Center at Garden City, Long Island, for assistance in obtaining the 
medical and service records involved. The opinions expressed are those of the individual authors and 
do not represent the opinions or policy of the Naval service. 
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RESULTS 


The results are presented in Table 1, which records the percentage of each group 
who were hospitalized or in disciplinary difficulty one or more times. Previous ex- 
perience has shown that this simple percentage is an adequate measure of group 
differences, and agrees closely with other measures that might have been used such 
as total number of incidents involved, time lost from duty, ete.“ The finding of 
the original study that psychiatrically marginal individuals who successfully com- 
plete a term of Naval service nevertheless show a higher incidence of hospitalization 
than do a group of normal controls is confirmed. Inspection of Table 1 shows that 
the differences between the marginal groups and the normal group are all in the ex- 
pected direction. With the exception of the group of psychopaths, all the differences 
are statistically significant at the 5% level. 


TaBLeE 1. HOsPITALIZATION AND DISCIPLINARY ACTION IN NORMAL AND 
MarainaL Groups 








o/, 
Classification Hospitalized 





Normal controls 
Neurotics 

Schizoid personalities 
Alcoholics 

Low intelligence 
Psychopaths 














The disciplinary data also confirm the original study in each instance. With 
the exception of the neurotic group, all the marginal groups show more disciplinary 
difficulty than does the normal control group. Only for the low intelligence and 
psychopathic groups, however, were the differences significant at the 5% level. The 
difference between normals and neurotics approaches the 8% level. 

These results should not be interpreted as indicating that the marginally. ad- 
justed man cannot be utilized by the military services. They merely show that such 
men are more “expensive” to the services, and that when their utilization is de- 
manded by manpower needs it will be necessary to make provision for the added 
demands they will entail upon medical and disciplinary facilities. They also confirm 
the clinical picture of the maladjusted individual as one who, even when he is meet- 
ing the formal adjustmental standards of his group, is doing so at a greater cost to the 
group’s medical facilities and with greater friction upon its social organization than 
is his adjusted compatriot. 


SUMMARY 


A comparison of a group of marginally adjusted individuals and a control 
group of normals, both of whom successfully completed a three-year term of Naval 
service shows that despite their successful completion of service the marginal group 
had a higher incidence of hospitalization and of disciplinary difficulty. The added 
demands that such marginal groups make upon hospital and disciplinary facilities 
in the military services must be considered in manpower planning. 
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CORRELATES OF THE NON-COMMITTAL TEST-ITEM RESPONSE 
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PURPOSE 

The omission of an item on an intelligence test may indicate frank ignorance of 
the response or, in the speed situation, a judgmental evaluation of best procedures. 
Omitted items are then an intrinsic contribution to the intelligence score. On an atti- 
tude scale, however, where a zero response is utilized to indicate uncertainty, lack 
of decision, lack of information or conviction, ignorance may be only one ingredient 
with some participation of other personality factors as well. 

It is proposed to examine here the correlates of this zero or non-committal 
response, which, like the ‘don’t know” response, may be, according to Marks? an 
evasion in a stigmatized situation. 


Previous STUDIES 

Since its inception the Bernreuter Personality Inventory has been a widely used 
and easily administered instrument for the analysis of personality in its more super- 
ficial aspects. In 1946 Ellis“ observed the paucity of studies showing objective or 
se validation of the test and instead an emphasis on statistical checks and 

alances. 

That the inventory differentiates marked deviates from normals under certain 
conditions seems clear“), but except for the B 4D scale’s distinction of normals and 
psychoneurotics, the scores of the other three scales are not significantly different. 
Furthermore, the scores can be slanted at will. Kimber’ was able to produce good 
adjustment scores by specific instructions toward this end. Sartain “* found the re- 
liability of scales low, ranging from .68 to .52. Martin, “) using Bennett’s simplified 
scoring stencils, traced most of the inventory’s effectiveness to the intro-extroversion 
and dominance-submission scales. Lorge“) found an interdependence of all total 
trait scores, nor was any trait sufficiently discrete for clinical diagnosis. 

Seeking other correlates of the BPI, Sanai and Pickard“ in 1946 found that 
Rorschach patterns do not support any of the popular, psychoanalytic or clinical 
assumptions regarding the relationships between radicalism and ‘temperamental 
traits.” Dexter®? in 1938 presented correlations of radicalism and three BPI traits 
between .24 and .28. 

In his summary, Kerr“? finds the liberal more introverted and intelligent. 
Eysenck, “ in a British sample, demonstrated extroversion in liberals and those pre- 
ferring the modern school of art. 

Nor do these personality traits depend on intelligence. Ellis seems to review no 
studies in this area“). Zalman“) comparing BPI results and academic success, 
disclosed one significant ‘“‘r’’ of .23 between introversion and military science, what- 
ever this may signify about intelligence. 


PROCEDURE 


In search of conservative-liberal attitude correlations, a group of 43 adults,* 
18 to 58 years of age, attending a series of lectures in psychology at the Indiana Uni- 
versity Extension in Fort Wayne, Indiana, during the fall of 1949, provided data on 
the Hunter Test of Social Attitudes, the Bernreuter Personality Inventory, the verb- 
al section of the Bellevue-Wechsler Intelligence Test and several personal items such 
as schooling, age, number of siblings, etc. The majority were office and business per- 
sons, the rest professionals, executives and a few factory workers. 

For the group, mean age was 31 (S. D. 10.63), mean formal schooling, 13 years, 
and relatively homogeneous in this respect (S. D., 1.61). An intelligence average of 
I. Q. 114 (S. D. 10.50) placed the group somewhat above the average though allow- 
ance might be made for some possible overestimation by the Bellevue-Wechsler, as 


*This group, part of a larger group, remained anonymous in the several testing situations of the 
experiment. 
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revealed in a previous study“. Assuming the comparability of the experimental 
and the norm group on three of the BPI trait measures, the 43 subjects seemed 
normally introverted (57th percentile, S. D. 29.67); normally self-sufficient (47th 
percentile, S. D. 23.74); but more than averagely submissive, (36th percentile, 8. D. 
26.60). According to Adorno et al“? submissive groups tend toward the authoritar- 
ian characteristics of the fascist mentality with its concomitant racial biases. This 
group, though somewhat submissive and more than a little conservative in five of 
seven sections of Hunter’s test, showed unusual liberalism toward the Negro“. 


RESULTS 
TABLE 1. CoRRELATES OF SEVERAL VARIABLES AND NUMBER OF NoNn-ComMITTAL RESPONSES 








Non-Commital Liberal Conservative 
Variable Responses* Intelligence Attitude Score 





Introversion-extroversion 14 .09 .000 
Dominance-submission —.26 —.07 .04 
Self-Sufficiency -.43 04 .09 
Age -.05 
Schooling -.07 
Intelligence -.31 -40 














*The correlations of non-committal responses, age and schooling derive from a population of 69; 
of the same and intelligence, from a population of 49; of the same and Bernreuter scores, 43. 


Of a large series of correlations, most of them reported elsewhere“, Table 1 


presents only those necessary to highlight an unexpected finding. Of the relation- 
ship of conservative-liberal attitudes and the Bernreuter scale the evidence is con- 
clusively negative. Though attitude is dependent on intelligence it hardly seems to 
relate to personality characteristics as revealed by the BPI. 

Among five answer categories, the Hunter test includes the zero response for 
uncertainty, lack of decision, lack of information or conviction. If lack of decision 
and conviction are attributable to lack of information, then, according to an intel- 
ligence test such as the Bellevue-Wechsler which includes information as a sub-test, 
there should be some relationship between number of responses marked 0 and in- 
telligence. This proves to be the case. The negative correlation, —.31, is significant 
at better than the 5% level and might be higher in a more heterogeneous popula- 
tion. (The B-W IQ ranged in this group from 98 to 138.) However, though intel- 
ligence as measured by the B-W seems an influential factor, the information sub-test 
a, yields a less than significant correlation of —.24 with number of responses mar- 

ed 0. 

Another hypothesis suggests that the more intelligent might resort to the non- 
committal response as a rejection of the idea or the wording of the test statement. 
This would operate in the direction of a positive correlation, serving to mask other 
factors making for a negative correlation. 

If the information and intelligence factors contribute in a measure to this degree 
of dependence, it must be concluded nonetheless that schooling plays little part in 
producing this variable since the r of schooling and number of responses marked 0 is 
a negligible -.07. Nor does the presumed accretion of information with age reveal 
any operation in the results. 

In addition to the information and intelligence ingredients is the uncertainty 
and lack of decision involved, and this shows a fair measure of dependence on the 
degree of individual self-sufficiency. The correlation of —.43, though not high, is 
significant at better than the 1% level, and possibly implies an intrinsic relationship 
worth further investigation. 
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During historically uncertain periods, the avoidance of attitude test statements 
might indicate simple unwillingness to commit oneself on suspect topics. The ‘don’t 
know” response would then, according to Marks, be an evasion in a stigmatized 
situation. But since S in the present experimental frame remained anonymous, this 
possibility is untenable except as “evasion” is a general personality characteristic 
regardless of the stigma in the situation. 

A feeling of self-insufficiency is a basic dynamic in the avoidance pattern, 
whether of persons, activities or ideas. Any test, therefore, which includes an option 
on evasion might be expected to yield, in addition to its primary function, this extra 
clinical diagnosis. 

As Rabin and Guertin “® observe in another connection, “almost any test em- 
ployed by a skilful clinician or insightful worker may prove an ‘aid in diagnosis’.”’ 
The omitted or non-committal response, whether on an intelligence test, an attitude 
scale or on one more directly designed for personality analysis, is recommended to 
the insightful worker as worthy of consideration. 


SUMMARY 


In a series of variables concerned with personality traits, only two showed signi- 
ficant correlations: The number of non-committal responses on a conservative- 
liberal attitudes scale with intelligence, and the same with degree of self-sufficiency. 
Neither age nor length of schooling played any considerable part nor did the in- 
formation sub-test of the Bellevue-Wechsler Intelligence Test yield a significant re- 
lationship with the number of responses marked 0 despite the fact that information 
is presumably operative in determining opinion. 

More clearly manifest is the dependence of lack of certainty, decision and con- 
fidence on the degree of feelings of self-insufficiency as measured by the Bernreuter 
Personality Inventory. 

The results of any test allowing the omission of items or the choice of a non- 


committal response might then be appraised in terms of this additional personality 
facet. 
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THE RELATIONSHIP BETWEEN Z SCORES ON THE BENDER GESTALT 
AND F+% ON THE RORSCHACH* 


ROBERT H. CURNUTT, LT. (JG) MSC, USNR AND WILLIAM B, LEWIS, ENS. MSC. , USNR 
U. S. Naval Hospital, National Naval Medical Center* 


INTRODUCTION 

Recently a scoring system for the Bender Gestalt test has been developed by 
Pascal and Suttell®. By tabulating certain deviations in the reproductions of the 
designs a raw score is obtained which in turn is converted into a Z score. These 
authors believe that the higher the Z score the greater is the likelihood that we are 
dealing with a psychiatric patient, while the lower the Z score the more integrated 
is the personality adjustment of the individual. This integrative capacity of the 
organism may be thought of as a function of the ego, for we find them saying: “Tf, 
for the purposes of this discussion, we limit our definition of ego function to the ability 
to reproduce faithfully the B-G drawings as presented, then we may say that ego 
strength lies on a continuum from very low to very high B-G scores. This quantita- 
tive continuum may, then, in a very limited way, be thought of as an aspect of ego 
strength—the lower the score the greater the ego strength.” ® 

Beck on the other hand believes that F+ on the Rorschach is also a measure of 
ego strength. He says: ‘Applying this criterion in the several personality groups 
studied by means of the Rorschach test, the striking fact is that the weaker the ego, 
the more F— percepts appear. The converse also holds: the stronger the ego, the 
more F+ associations.” 

At first glance it appears that Pascal and Suttell claim to be measuring the same 
function with their Z score that Beck believes the F+% measures. One would expect 
to find an inverse relationship between these scores. That is, in a given population, 
the higher the F+%, the lower will be the Z score. It should be recognized, how- 
ever, at the outset, that these authors may be talking only about certain specific 
aspects of “tego strength,”’ or they may merely be using similar language in describ- 
ing unrelated factors. If these two “ego strengths” or “aspects of ego function” are 
essentially the same there should be a high negative correlation between these two 
sets of scores. 

The hypothesis to be tested may be stated as follows: The Bender Gestalt Z 
re measures primarily the same function (ego-strength) as does the Rorschach 

+ 40+ 
MeETHOD 

The records of 25 patients who had been hospitalized in a Naval Neuropsy- 
chiatric Unit were withdrawn from the files. The only bias in their selection was 
that they had all been given the Rorschach and Bender tests by the same exper- 
ienced clinical psychologist. This was done to insure that any individual deviation 
in the F+% scoring would be constant. The Berider records were scored according 
to the method of Pascal and Suttell. The author has found that his scoring average 
is 3.6 points above that of Pascal and Suttell who differ by 4 or 5 points between 
themselves. 

Our sample consisted of 7 patients with organic brain damage, 8 schizophrenics, 
9 neurotics, and one case of psychosomatic asthma. F+% and Z scores were corre- 
lated using the Rank Difference Method. 


RESULTS AND Discussion 
A Rank Difference coefficient (Rho) of .07 was obtained for the Z scores and 
F+% of the group. Thus the expected relationship between F+ % and Z scores was 
not realized. It appears then that they do not measure the same function in this 
mixed clinical group. Perhaps the F+% does not measure ego strength while the Z 
score does measure it, or the converse may be true. Again, the two scores may be 
measuring different, mathematically unrelated aspects of ego strength. 


*The opinions contained herein are the private ones of the writers and are not to be construed as 
official or reflecting the views of the Navy Department or the Naval Service at large. 
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A third possibility is that these scores measure different functions in discrete 
nosological groups, or even in different individuals.! The authors favor this alterna- 
tive, reasoning as follows: It is falacious to assume that a given Z score or F+% can 
be explained in terms of some constant factor which the Z or F+ % always measures. 
What is measured is a function not only “of the measuring instrument” but of what 
there is to be measured. The Bender Gestalt Z score and the Rorschach F+% vary 
in relation to an unknown number of factors. One cannot assume that the primary 
variable in a specific case is “ego strength” (or anything else) in the absence of furth- 
er knowledge of the individual tested. 


SUMMARY 


In order to determine the relationship between two reputed measures of ego 
strength, the Bender Gestalt Z score and the Rorschach F+%, these scores from a 
heterogeneous group of 25 neuropsychiatric patients were correlated, using the 
Rank Difference method. No significant relationship was found. The implications 
of this finding are discussed. 


REFERENCES 
1. a. 8. Rorschach’s Test II. A Variety of Personality Pictures. New York: Grune and Stratton, 
1947. 
2. Curnutt, R. H. The Use of the Bender Gestalt With An Alcoholic and Non-Alcoholic Popula- 


tion. J. clin. Psychol. 1953, 9, 287-290. 
3. Pascat, G. and Sutret, B. ‘The Bender-Gestalt Test. New York: Grune and Stratton, 1951. 


1The Z scores and F+% of the 7 brain-damaged patients in the sample, considered alone, yielded 
a Rho of -.58. Though not statistically significant for so small a sample, it may indicate a trend 
within this nosological group. 


CORRELATION OF CLINICAL ESTIMATES WITH TEST SCORES ON 
MENTAL ABILITY AND PERSONALITY TESTS 


J. WATSON WILSON 


Nordli, Ogan, Wilson Associates 
‘New York, N. Y. 


PROBLEM 


This study is concerned with the predictability of performance on intelligence 
and personality scales through the use of a structured, clinical interview. It raises, 
and perhaps partially answers, the question: with what degree of accuracy can a 
clinician, using a history-taking procedure, predict performance on such tests? 


Data AND METHODS 


With no particular research in view, the writer has long attempted to estimate 
the client’s test scores prior to the administration of the tests. An examination of 
case files of four client companies (a large health and accident company, a small fire 
insurance company, a large friction materials manufacturer, and a large corporation 
manufacturing fuel injection and electrical equipment) yielded 250 cases on which 
estimates were recorded for several different tests. Three tests were found to have 
been used with sufficient frequency over a two year period to warrant computing 
correlations. A wide variety of personnel were included in the group studied: engin- 
eers, salesmen, accountants, underwriters, officers, and foremen. 

Before discussing the data, it would be advisable to provide a context for the 
predictions. With each individual, the clinician took a detailed personal history. 
It consisted of job, educational, family, social, and health histories. This typically 
took from 20 to 40 minutes. 
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In taking the history certain general hypotheses with respect to the individual 
were established. At this point a personality inventory such as the Personality Re- 
cord (a short form adjustment inventory) was administered. While the evaluee was 
completing the personality inventory, significant parts of his history were isolated 
for further study and highly individual questions were prepared for a deeper explora- 
tion. From this study came further hypotheses and suggestions for testing. The 
Beckman, oral projection or parts of the T. A. T. frequently were used at this point. 

A mental ability test such as the Personnel Problems Test, Otis, Wonderlic, or 
Wechsler-Bellevue was administered. In each instance, predictions were recorded 
before the tests were given. 


RESULTS 


On the Personality Record, scores (atypical responses) and predictions were 
available for 75 cases. Figure 1 is the scattergram for these comparisons. The 
atypical responses ranged from 2 to 24, a score of 2 being at the “well adjusted” end 
of the scale and a score of 24 being extreme on the “neurotic” end. The correlation 
between predictions and scores was .81 (or,y = .038). The average scale unit differ- 
ences between the predictions and the score was 1.5. 

The psychologist underestimated test scores in 25 cases (33.3 per cent), and 
overestimated in 31 cases (41.3 per cent). 

On the Beckman, predictions were availab!« for 70 cases. Here, predictions mere- 
ly indicated the group or decile in which the sc: ¢ would fall, i. e. A4, A3 — to — 83, 
$4. The range was from S4 (the extreme on tlic - \bmissive end) to A4 (the extreme 
on the aggressive end). The correlation bet«+ 1 predictions and scores was .66 
(orgy = .067). The average of the differences 31.1. The scattergram for this 
comparison is contained in Figure 2. In 22cas (31 per cent) the psychologist un- 
derestimated aggressiveness and in 23 cases (' ‘r cent) he overestimated it. 

On the Personnel Problems Test (a short-i.1™, intelligence test) it was possible 
to calculate correlations between predictions aud scores with respect to both items 
tried and items answered correctly. Figure 3 is the scattergram for the comparison 
of predictions and scores with respect to items answered correctly. Data were avail- 
able on 160 cases. Here the range of scores was from 11 (the 17th percentile) to 34 
(the 99th percentile). The correlation between predictions and scores was .74 
(orxsy = .036). The average of the differences was 2.0. In 74 cases (46 per cent) 
the psychologist overestimated the mental ability score while he underestimated 
it in only 54 cases (34 per cent). 

Figure 4 is the scattergram for the comparison of predictions and results with 
respect to the number of items the individual would attempt. Here only 149 cases 


Figure 1 


Relationship of Predictions to Scores Figure 2 
on the “Personality Record” 
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were available. The range was from 19 (the 57th percentile) to 34 (the 99th per- 
centile). The correlation between the prediction regarding the number of items the 
individuals would try and,the number they actually did try was .65 (ory = .047). 
The average of differences was 1.7. Whereas the psychologist tended to overestimate 
the individuals’ mental ability, he tended to underestimate the number of items the 
individuals would attempt. He underestimated in 67 (45 per cent) and overestimated 


in only 35 cases (23.5 per cent). 


SUGGESTIONS AND QUESTIONS 


Research of this sort may raise more questions than it answers. However, it 
offers certain suggestions: 


1. Following a patterned interview with which he is familiar, using tests with 
which he is familiar, and administering psychometric instruments to test specific 
hypotheses, the clinician can predict with a fairly high degree of accuracy how the 
individual will perform or “score”. Actually, the correlation between the clinician’s 
prediction and the “‘score” is somewhat higher in certain instances than correlations 
between supposedly comparable tests. This, of course, raises the question, whether 
or not the clinician should rely more on his clinical judgment and less on test scores. 


2. A carefully structured interview may have greater validity with respect to 
predicting level of ability, personality structure and the like than historic studies of 
the interview would indicate. This is counter to historic studies which suggest that 
characteristics such as those examined here cannot be adequately discerned on the 
basis of an interview. 


3. Finally, the question must be raised whether the findings here can be dupli- 
cated by others. While this may be problematical, the author holds no brief either 
for his own techniques or his particular skill in the use of them. Certainly other 
studies of this sort are needed. Such studies, adequately planned, could include a 
wider variety of tests than was possible here. 
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CRITIQUE OF TRAINING PROGRAMS IN CLINICAL PSYCHOLOGY 


American universities with the cooperation of the VETERANS ADMINISTRATION 
and the EpucaTION AND TRAINING Boarp of the AMERICAN PsycHOLOGICAL Asso- 
CIATION have truly achieved a prodigious accomplishment in establishing and 
standardizing high calibre training programs in clinical psychology and counseling 
psychology. In a short period of less then 10 years, a degree of progress has been 
achieved which is truly remarkable when compared with the slower evolution of 
standardized training in other specialties. The remarks which follow therefore 
should not be construed as involving any blanket criticism or condemnation, but 
simply as an attempt to highlight certain features in which further progress in the 
near future may be hoped for. 


1. Too much didactic teaching. There appears to be too much emphasis on organized formal 
courses, lectures, symposia and theoretical discussions. Time spent on formal course re- 
quirements far overbalances clinical ward work. Students should get their theoretical 
orientation from required readings, with only interpretive comments from formal lectures. 

2. Reliance upon Authority. There seems to be a tendency to gain respectability for clinical 
psychology training programs from basking in the reflected light of “big names” and 
authorities in neighboring fields. Students spend too much time listening to theoretical 
discourses from these authorities who often tend to “ride” their own interests or schools. 

3. Preoccupation with Theory. Trainees finish their programs having invested an inordinate 
amount of time learning about theories and relatively little time in face-to-face clinical 
contacts. A more operational approach would acquaint the student more organically with 
exactly what a clinical psychologist actually does in a wide variety of situations. Many 
of the hours now given to didactic lecture work could better be devoted to supervised clin- 
ical contacts. 

4. No Psychological Ward Rounds. The most glaring defect of training programs is the 
lack of psychological ward rounds and clinical psychopathological conferences in which 
students could come face-to-face with a wide sampling of case materials under personal 
supervision of instructors who would point out important points in “bedside” teaching. 

5. Inadequate Experience with the Whole Person. Before beginning the study of more special- 
ized techniques, the student should first be placed on the wards or in the clinic, and exposed 
to all the little details of how to relate to patients, how to influence them for desired ends, 
and in general to gain an organic experience with their conditions of living. Each student 
should first become adept in face-to-face case evaluation and handling. Clinical judgment 
can only be developed in this manner. This experience might well be gained by starting 
the trainee in as a ward attendant for three month periods during summer vacations. 

6. Prematuré Introduction to Testing. In many installations, the student is introduced to 
clinical contacts first and only through testing. This approach is very harmful in that the 
student gains little first hand experience in developing his own clinical judgment and 
Py ability but is thrown into a more or less mechanical application of technical 
skills. Testing should come much later only after basic clinical skills are developed. 

7. Unbalanced training experiences. There is an unfortunate trend in the direction of over- 
—— the psychoneuroses and psychoses at the expense of other areas such as mental 
deficiency, child guidance, delinquency, marriage counseling and other problems of equal 
social importance. With respect to mental deficiency, for example, this is a field to which 
clinical psychologists can make a t contribution and also in which there is great social 
need. All training programs should include at least 6 months in an institution for mental 
defectives as well as representative clerkships in other important areas. It is not unusual 
to find graduates of current training programs who have had adequate experience only 


with neurotics and psychotics. This reflects the undue influence of mental hospitals in 
their training. 


8. Standards too Inflexible. While it is desirable for standards established by the Veterans 


Administration and the APA boards to be sufficiently rigid, there is also the undesirable 
"7 that extreme yoy may inhibit flexibility in working toward better programs. 
t is understandable that the Veterans Administration should desire that training pro; 
provide the necessary background for VA work, but too rigid an application of such 
policies will leave the trainee unbalanced for other responsibilities which he may be called 
to assume outside VA auspices. The VA and the APA should permit recognized university 
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departments to experiment with the broadening of training programs. The present situa- 
tion is comparable to that which would occur in medical education if the Army supervised 
all training in the direction of its own needs. 


9. Research vs Clinical Proficiency. To our knowledge, it has never been scientifically proven 
that the good researcher is always the good clinician, or vice versa. On the a 
clinical experience suggests the opposite. Research ability is a rare quality which sho d 
not be equated with clinical sensitivity. Clinically sensitive trainees should not be held 
back if they happen to lack research qualifications. 


In any program for evaluating clinical training facilities, there is a tendency for 
the “seal of approval’’ to gain almost a magical or mystic significance, constituting 
as it does the mark of approval of existing ‘‘authorities”. The agency seeking ap- 
proval may even tend to sacrifice its own unique potentialities for growth because of 
the need to submit to whatever standards may be in vogue. It is not beyond the 
borders of possibility to conceive of a situation where some “school” of thought 
might gain sufficient ascendancy so as to be able to dictate nationwide standards 
based on its own particular pattern. The Behaviorists succeeded in dominating 
the American scene in this manner for a few years, just as psychoanalytic influences 
have gained overwhelming influence in some parts of the contemporary psychiatric 
scene. Let us hope that hope that standards in clinical psychology remain flexible 
and eclectic in their orientation. 


F.C. T. 





LET’S BURY THE DEAD 


A visit to any drugstore will provide convincing evidence of the truism that 
obsolete medicines continue in popular use far after they have been proven worthless 
and supplanted by more valid medications. This is true not only of patent medicines 
which are kept alive by their producers for commercial reasons, but also of secondary 
medicants which may have some therapeutic value but which have been replaced by 
more potent or less toxic substances. The medical profession has attempted to differ- 
entiate between substances of differential usefulness by establishing three classi- 
fications of drugs according to their usefulness and proven validity. The U. S. 
Pharmacopeia provides a general listing of some nine or ten thousand substances for 
which some therapeutic value is known or claimed. It includes many substances 
which are now considered obsolete or have been replaced by drugs of greater potency. 
New and Nonofficial Remedies presents a compilation of claims and facts concerning 
new drugs or proprietary remedies which have only recently appeared with insuffic- 
ient evidence to establish their potency and specificity. Finally, the American Med- 
ical Association publishes a small book Useful Drugs which lists the few hundred sub- 
stances which have been clearly established to have specific value in controlling the 
causes and symptoms of disease. In Useful Drugs, such substances are listed singly 
in order to emphasize their rational usage for specific indications. In general, medi- 
cal students are not required to familiarize themselves with all the drugs in the 
U. S. Pharmacopeia, or even in New and Nonofficial Remedies, but are expected to 
concentrate on the specifics listed in Useful Drugs. 


The medical profession is thus kept informed concerning the relatively small 
number of drugs having high specificity, high potency and low toxicity. Needless 
to say, such classifications are subject to constant modification as new developments 
render old practices obsolete. In a field such as medicine where almost infinite com- 
plexity is made possible by all the permutations and combinations in which drugs 
can be prescribed, it is very desirable to have some system to separate the chaff from 
the wheat so as to permit the average practitioner to discriminate what is validly use- 
ful from all the remedies which are offered him. 
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Clinical psychology is rapidly reaching a similar stage of development in which 
hundreds of diagnostic and therapeutic methods are available to choose from. Many 
of these methods are outdated, having been proven by research results to be invalid 
or inferior to improved methods. However, such instruments continue to be used 
both in research involving faulty design and in clinical practice where wholly invalid 
predictions continue to be drawn. Poorly trained clinicians throughout the country 
are still making socially important decisions based on the results of such tests. As 
an editor, we reject each year scores of papers submitted by earnest researchers who 
spend hundreds of hours on research which is handicapped right from the start by 
use of obsolete methods. 


In order to render the coup de grace to methods which have outlived their use- 
fulness, it seems necessary to have some official pronouncement to the effect that 
they are now considered obsolete and invalid. Ideally, such a proclamation could 
well be made by the discoverer of the method in a public statement to the effect that 
its use is no longer indicated. Since it usually requires five or more years of research 
to establish the generalized validity of claims, it might be appropriate for the orig- 
inator of new techniques to issue ten and twenty year follow-up reports summarizing 
the author’s own experience. How valuable it would be to have a statement from 
John B. Watson concerning his evaluation of the evolution of Behaviorism! As far 
as we know, only a limited number of authors have had the courage to review the 
shortcomings of their discoveries or even to indicate their existence. This would 
almost seem to be an ethical obligation. 

The other alternative might be for the AMERICAN PsycHOLOGICAL ASSOCIATION 
to establish a CounciL oF STANDARDS AND VALIDITY authorized to issue periodic 
statements based on a summary of research concerning methods which are consid- 
ered valid or invalid for modern conditions of practice. Commercial firms offering 
tests or methods to the profession might be required to submit such to the Council 
in order to secure approval of standardization, validity and reliability. Eventually 
there would accumulate a list of approved tests and methods which would serve as 
a guide to less trained members of the profession by making available periodic eval- 
uations. Such a list of approved and accepted methods would also be of value to 
nonprofessional agencies seeking psychological services in aiding them to discrim- 
inate between professional proficiency and charlatanism. It would not impede basic 
research but would simply apply classificatory labels to assist in discriminating be- 
tween unfounded claims and research validation. 


y. ©... 
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Anon. Ethical Standards of Psychologists. Washington, D. C.: American Psychologi- 
cal Association, 1953, pp. 171. 


Clinical psychologists and workers in related fields should familiarize them- 
selves with the official ethical standards governing professional psychologists. 


HALPERN, Fiorence. A Clinical Approach to Children’s Rorschachs. New York: 
Grune & Stratton, 1953, pp. 270. $6.00. 


The author is Assistant Professor of Clinical Psychology, New York University 
College of Medicine and a recogtiized authority on Rorschach testing. This book 
makes an important contribution to the testing of children through its theoretical 
discussions supplemented by 40 clinical records from well selected illustrative cases. 


SarGENT, Heten D. The Insight Test. New York: Grune & Stratton, 1953, pp. 276. 
$6.75. 


Dr. Sargent is senior psychologist of the education department of the Men- 
ninger Foundation. The Insight Test is a verbal projective test first introduced in a 
monograph in 1944 and consists of a series of items or armatures presenting hypoth- 
etical social situations testing the ability of the subject to understand the motives of 
others. The scoring system is elaborated in great detail with illustrative case mat- 
erials. 


Wuitaker, Cart A. AND Matong, THomas P. The Roots of Psychotherapy. New 
York: Blakiston, 1953, pp. 236. $4.50. 


The authors are associated with the psychiatry department of the Emory Univ- 
ersity School of Medicine. The book consists of rather elementary theoretical essays 
concerning foundations, psychotherapy as process, and a brief account of some brief 


techniques in psychotherapy. It is difficult to understand what group this book is 
directed toward. 


NAUMBERG, MARGARET. Psychoneurotic Art: Its Function in Psychotherapy. New 
York: Grune & Stratton, 1953, pp. 148. $6.75. 


This monograph presents a very interesting study of the art productions of a 
20-year-old psychoneurotic girl during 34 months of art therapy. The symbolic 
patterns clearly reveal the underlying conflicts which were gradually clarified in 
association with other materials including fantasies, dreams and writings. 


Various contributors. Drug Addiction among Adolescents. New York: Blakiston, 
1953, pp. 320. 


Verbatim transcriptions of the proceedings at two conferences held on Novem- 
ber 30, 1951 and March 13 and 14, 1952 sponsored by the Committee on Public 
Health Relations of the New York Academy of Medicine. The prepared papers and 
spontaneous group discussions contain much background material for those inter- 
ested in addiction problems. 


DERNER, GorRDON F. Aspects of the Psychology of Tuberculosis. New York: Paul B. 
Hoeber, 1953, pp. 119. $3.50. 


A Psychosomatic Medicine Monograph reporting the results of comprehensive 
psychological studies made at the Tuberculosis Division of the Grasslands Hos- 
pital on 32 tubercular patients. No unique tuberculous personality patterns were 


elicited and the study also failed to demonstrate the euphoria reported in the litera- 
ture. 
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WirtrkoweEr, Eric and Russeui, Brian. Emotional Factors in Skin Disease. New 
York: Paul B. Hoeber, 1953, pp. 214. $4.00. 


A Psychosomatic Medicine Monograph presenting authoritative general dis- 
cussions of the psychosomatic approach to skin disease, the study and management 
of skin conditions, and normative and clinical data of cases showing various ab- 
normalities of the skin and skin sensations. 


Wortis, JoserxH (Ed.) Basic Problems in Psychiatry. New York: Grune & Stratton, 
1953, pp. 186. $4.50. 


A distinguished group of contributors express their opinions in rather general 
terms on the contemporary status of psychiatry, its physiological bases, psychology 
and culture, the validity of mental testing, schools of psychiatry and the psycho- 
somatic symptom. The presentation is on the level of the intelligent layman. 


Von Foerster, Hetnz (Ed.) Cybernetics. New York: Josiah Macy, Jr. Foundation, 
1953, pp. 184. $4.00. 


Proceedings of the Ninth Conference on Circular Causal and Feedback Mech- 
anisms in Biological and Social Systems sponsored by the Josiah Macy, Jr. Founda- 
tion. 


Rau, Evarne P. (Ed.) Adrenal Cortex. Proceedings of the Fourth Conference. 
New York: Josiah Macy, Jr. Foundation, 1953, pp. 165. $4.00. 


ABRAHAM, Karu. Selected Papers on Psychoanalysis. New York: Basic Books, 1953, 
pp. 527. $6.00. First American edition. 


Karpr, Fay B. The Psychology and Psychotherapy of Otto Rank. New York: Philo- 
sophical Library, 1953, pp. 129. $3.00. Historical Introduction. 


June, C. G. Two Essays on Analytic Psychology. New York: Pantheon Books, 1953, 
pp. 329. $3.75. Volume 7 of the collected works of C. G. Jung. 


Fie.tp, Minna. Patients are People. New York: Columbia University Press, 1953, 
pp. 244. $3.75. A medical-social approach to prolonged illness. 


Furess, Rosert. The Revival of Interest in the Dream. New York: International 
Universities Press, 1953, pp. 164. $3.00. A critical evaluation of post-Freudian 
psychoanalytic contributions. 


Riese, WattHER. The Conception of Disease: Its History, its Versions and its 
Nature. New York: Philosophical Library, 1953, pp. 120. $3.75. 


ALLEN, Rosert M. Introduction to the Rorschach Technique. New York: Inter- 
national Universities Press, 1953, pp. 126. $3.00. 


An elementary manual of administration and scoring for beginning students. 


PraGet, JEAN. The Child’s Conception of Number. New York: Humanities Press, 
1953, pp. 248. $5.00. 





For children of average or superior 
intelligence, with emotional and behavior problems: 


The Department of Child Psychiatry of 
The Menninger Foundation 


offers 
Intensive individual psychotherapy in a residential school 


Outpatient psychiatric and neurologic evaluation and treatment for 
children up to 18 years of age is also available. 


THE SOUTHARD SCHOOL 


J. Cotter Hirschberg, M. D., Director Topeka, Kansas, Telephone 3-6494 


THE HOWARD INK BLOT TEST 
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Because 
The Parents’ Trust is in You 


You should know the facts 
about the Devereux program 
of Education with Therapy 





XCELLENT professional facilities and 

ong experience have equipped Devereux 
Schools to give effective education with 
therapy to the boy or girl whose emotional 
disturbances block his normal ability to 
learn. 

Under the guidance of psychologists and 
psychiatrists, the staff develops a highly 
individualized program for each child. The 
group in which the child lives is chosen 
carefully to meet his social and educa- 
tional needs, providing constant compan- 
ionship and consistent stimulation towards 
natural family relationships. 

You may have occasion to advise parents 
whose child may benefit from this pro- 
gram. The Devereux Schools staff will be 
pleased to study the case carefully and offer 
a detailed report on the possiblity of util- 


izing the Devereux program of education 
with therapy. 





Please address your inquiries to 


HeLena T. Devereux, Director 
J. Curttorp Scott, M.D., Executive Director 
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